Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Sign In to gain access to subscriptions and/or personal tools.
RELC Journal
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Kuo, C.-H.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Can Numbers Talk? Basic Data Management of a Corpus

Chih-Hua Kuo

National Chiao Tung University

This study attempts a series of quantitative analyses on a cornucopia of data in the Corpus of Scientific Journal Articles (CSJA), a special- purpose corpus consisting of 360 journal articles in 10 major scientific fields. Major findings include: (1) the average word length is 6.31 characters;(2) a word-form occurs 36.8 times on average;(3) a text category having a larger number of running words tends to have a higher word recurrence rate; (4) most of the 100 most frequent word-forms are function words; (5) in comparison with the COBUILD corpus and the LOB corpus, numbers and letters are much more frequently used in the CSJA than in the other two corpora; (6) only a very limited number of word-forms have a high recurrence rate while more than half of the vocabulary occur only once or twice; (7) despite disciplinary difference, word frequency profiles of the ten scientific fields are very similar, showing that different scientific fields bear similar patterns in the use of words.

RELC Journal, Vol. 30, No. 1, 1-17 (1999)
DOI: 10.1177/003368829903000101


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Language Teaching ResearchHome page
E. Gatbonton
Looking beyond teachers' classroom behaviour: Novice and experienced ESL teachers' pedagogical knowledge
Language Teaching Research, April 1, 2008; 12(2): 161 - 182.
[Abstract] [PDF]