A Genre Analysis of Chinese and English Abstracts of Academic Journal Articles: A Parallel-Corpus-ba...
Download
1 / 25

NIU Gui-ling School of Foreign Languages, Zhengzhou University [email protected] - PowerPoint PPT Presentation


  • 123 Views
  • Uploaded on

A Genre Analysis of Chinese and English Abstracts of Academic Journal Articles: A Parallel-Corpus-based Study. NIU Gui-ling School of Foreign Languages, Zhengzhou University [email protected] Contents. Brief Introduction of Chinese-English Parallel Abstract Corpus (CEPAC)

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about ' NIU Gui-ling School of Foreign Languages, Zhengzhou University [email protected]' - tirza


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

A Genre Analysis of Chinese and English Abstracts of Academic Journal Articles: A Parallel-Corpus-based Study

NIU Gui-ling

School of Foreign Languages, Zhengzhou University

[email protected]


Contents
Contents Academic Journal Articles: A Parallel-Corpus-based Study

  • Brief Introduction of Chinese-English Parallel Abstract Corpus (CEPAC)

    • 教育部人文社会科学研究规划基金项目“基于平行语料库的中外学术期刊论文中英文摘要的语类分析研究”

  • A Genre Analysis of Chinese and English Abstracts of Academic Journal Articles


Why ra abstract
Why RA Abstract? Academic Journal Articles: A Parallel-Corpus-based Study

  • Introduced into medical research articles first during the 1960s (Swales, 2010) .

  • Underresearched: In 2005 (Montesi & Urdiciain): barely 28 studies regarding research article (RA) abstracts then.

  • Now: A sea of information, “an information explosion”, with several million research papers being published each year, and continual announcements of new journals being launched, either online or in hard copy or both.

  • RA abstracts “have become a tool of mastering and managing the ever increasing information flow in the scientific community” (Ventola 1994a: 333), and developed into an increasingly important part-genre (Swales 2009).


Objective
objective Academic Journal Articles: A Parallel-Corpus-based Study

  • MOVEs and steps: differentiate from research articles as an independent genre in rhetorical MOVE structure.

  • To generalize and illustrate the generic structural potential (GSP) of abstracts by using the advantage of big texts of corpus and analyzing the usage of rhetorical MOVE in them, with genre analysis and schematic structure as the theoretical basis.


What is an abstract
What is An Abstract? Academic Journal Articles: A Parallel-Corpus-based Study

  • Cleveland (1983:104): An abstract summarizes the essential contents of a particular knowledge record and is a true surrogate of the document.

  • Graetz (1985): The abstract is a time-saving device that can be used to find particular parts of the article without reading it; … knowing the structure in advance will help the reader to get into the article; …. An abstract should be concise and precise, indicating to the potential reader two things: (a) what was done, and (b) important results obtained.


Functions
Functions Academic Journal Articles: A Parallel-Corpus-based Study

  • Huckin (2001): 4 distinguishable functions: ⑴They function as stand-alone mini-texts, giving readers a short summary of a study’s topic, methodology and main findings; ⑵They function as screening devices, helping readers decide whether they wish to read the whole article or not; ⑶They function as previews for readers intending to read the whole article, giving them a road-map for their reading; ⑷They provide indexing help for professional abstract writers and editors;

  • ⑸They provide reviewers with an immediate oversightof the paper they have been asked to review.(Swales, 2010)


未来工作 Academic Journal Articles: A Parallel-Corpus-based Study

Future Work

数据提取

Data retrieval

语料库标注

Annotation

应用分析

Application research

成功PPT四要素

Chinese-English Parallel

Abstract Corpus (CEPAC)

语料库构成

components

CEPAC


Overview of cepac corpus
Overview of CEPAC Corpus Academic Journal Articles: A Parallel-Corpus-based Study

3780texts

673246words/

characters

Corpus size

CC:1260 texts

CE:1260 texts

EE:1260 texts

CC:209889/376972w/c;

CE:221240w;EE:242117w.


Corpus structure
Corpus Structure Academic Journal Articles: A Parallel-Corpus-based Study

  • Three sub-corpora:

CC(Chinese) Corpus

CE(English) Corpus

Data Retrieval

Management & Text Update

EE(English) Corpus


Five main disciplines
Five main Disciplines Academic Journal Articles: A Parallel-Corpus-based Study

Under each of the three sub-corpora, five disciplinary categories are respectively established, For each category under the three sub-corpora, 252 abstracts were chosen. The sub-disciplines chosen in the three sub-corpora basically also conform to each other, ensuring the balance of texts.

HHealth Sciences

1

S Social Sciences and Humanities

2

3

P Physical Sciences and Engineering

4

L Life Science and Biomedical Sciences

YLanguage Sciences and Literature

5


Inter intra lingual interdisciplinary comparison
Inter-,Intra-lingual & Interdisciplinary Comparison Academic Journal Articles: A Parallel-Corpus-based Study


Annotation
Annotation Academic Journal Articles: A Parallel-Corpus-based Study

  • Macrostructure and Meaning of MOVEs and Labels (cited from: Swales & Feak, 2009)


Move attributes values
Move Attributes & Values Academic Journal Articles: A Parallel-Corpus-based Study


Tools

Wordsmith Academic Journal Articles: A Parallel-Corpus-based StudyTools,Antconc,

Self-disignedprogramms,

SPSS,etc

MMAX2

ICTCLAS & CLAWS

Editplus, Paraconc

TOOLS

Data Retrieval & Statistics

Annotation

POS Tagging

Text Processing,

Alignment


Annotation interface mmax2
Annotation Interface _ MMAX2 Academic Journal Articles: A Parallel-Corpus-based Study


Multi level annotation stand alone

POS Tagging Academic Journal Articles: A Parallel-Corpus-based Study

Move Level

Background

Purpose

Method

Results

Conclusion

  • Sentence pairs

  • (Alignment)

  • 1-1

  • 1-2

  • 1-3

  • 2-1

  • 3-1

  • 2-2

Syntax

Semantic

Translation

Chinese Word Segmentation

Multi-level Annotation(Stand-alone)


Data analysis
Data Analysis Academic Journal Articles: A Parallel-Corpus-based Study

  • Figure 3 Normalized Frequency of Move Distribution (Every 10,000 running words)


Completeness wholeness of moves in texts move number distribution
Completeness (Wholeness) of MOVEs in Texts: Move Number Distribution

  • Figure 2 MOVE Numbers and Percentage in Texts in Three Sub-corpora(unit:text)


Significant difference
Significant Difference Distribution

[The asterisks (*) indicate significance level, and the “+” and “-” signs on the right column indicate “overuse” and “underuse”.


Proportionality balance of moves in texts text coverage
Proportionality(Balance) of MOVEs in Texts:Text Coverage( Distribution文本覆盖率)

  • Independent MovesCompound Moves Other

  • Background Purpose MethodResultConclusion RegularIrregular

  • YCC 27.17%6.97%2.31%11.17%34.39% 10.01% 7.63% 0.34%

  • YCE 28.97% 10.41% 4.23% 11.83% 35.68% 4.45% 4.03% 0.40%

  • YEE 18.21%14.49%20.26%23.75%19.48% 2.87% 0.82% 0.11%

  • HCC 4.78% 11.39% 27.24% 37.21% 15.84% 3.35% 0.19%

  • HCE 5.44% 12.42% 27.71% 37.30% 16.49% 0.65%

  • HEE 20.00% 10.30% 16.36% 36.84% 15.48% 0.96% 0.06%

  • LCC 16.45% 6.94% 11.84% 38.33% 13.16% 8.76% 4.53%

  • LCE 17.52% 10.41% 15.89% 38.64% 13.30% 3.17% 0.97% 0.10%

  • LEE 17.67% 11.52% 12.50% 39.12% 15.21% 2.93% 0.92% 0.13%

  • PCC 10.47%7.77% 25.01% 23.11% 10.40% 21.50% 1.74%

  • PCE 12.96% 9.27% 30.69% 27.60% 11.88% 6.54% 1.05%

  • PEE 21.82%14.57% 17.47% 25.29% 18.61% 1.66% 0.57%

  • SCC 26.57%3.60%5.24% 23.08% 22.10% 13.69% 5.67% 0.05%

  • SCE 28.44% 5.83% 6.80% 24.65% 24.30% 6.59% 3.34% 0.05%

  • SEE 14.71%17.61%20.09% 24.43% 17.77% 3.41% 1.99%

    Average18.08% 10.23% 16.24% 28.16% 18.94% 6.04% 2.23% 0.08%

注:1. Text Coverage=该语步词数/语步所在摘要的总词数。2. 独立语步:一个句子中仅包含一种语步内容。

3. 复合语步:一个句子中包含两种或两种语步以上的内容。正序复合语步指一个句子中的复合语步按“背景-目的-方法-结果-结论”的先后顺序正常排列;

而逆序复合语步则指一个句子中的复合语步并非按上述顺序排象,譬如,方法语步置于目的语步之前,或结论语步置于结果语步之前。



Data query
Data Query Distribution


Data query kwic
Data Query_KWIC Distribution


Future work
Future Work Distribution

Data Retrieval & Application

Short-term Online Retrieval

Genre Analysis

Translation

Hedging

Grammar

Online Retrieval

Pragmatics

Semantics

Discourse Analysis

World Englishes

etc.


The End Distribution

Thank You!


ad