基于大规模语料的英语词汇重复率研究

摘要
图/表
参考文献
相关文章 (15)

全文: PDF (0 KB) HTML (1 KB)
输出: BibTeX | EndNote (RIS)

摘要摘要：本研究将英国国家语料库(BNC)和美国国家语料库(ANC)大规模海量笔语语料随机分为60个实验组和41个检验组，总计83,864个语篇对，通过计算机编程的手段对英语词汇重复率进行动态分析。建立了估算词汇重复率的数学模型，并运用60个实验组对此公式进行了检验。研究发现，词汇重复率曲线的分布较有规律，极值较少；词汇重复率变化曲线为非线性；词汇重复率预测公式误差较小，可以用于估算不同长度的真实语篇英语词汇重复率的理论数值。

	服务

	把本文推荐给朋友
	加入我的书架
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	赵小东冯志伟

Abstract：Abstract This research randomly divides large-scale written British National Corpus (BNC) and American National Corpus (ANC) into the experimental set and test set, with the former containing 60 samples and the latter 41 samples, totaling 83,864 pairs of texts. A dynamic analysis is made to study the English vocabulary repeat rate by means of computer programs. A mathematic model to calculate vocabulary repeat rate is established and then tested based on the 60 samples in the experimental set. Results show that the distribution curves for vocabulary repeat rates are nonlinear and regular, with only a few outliers；the inferred formula experiences a very small margin of error in the calculation of theoretical repeat rate, and can be used to estimate the theoretical values of vocabulary repeat rate for authentic English texts of different lengths.

引用本文:

赵小东冯志伟. 基于大规模语料的英语词汇重复率研究 [J]. 外语与外语教学, 2016, 01(04): 87-.

链接本文:

http://112.126.70.247/wy/CN/ 或 http://112.126.70.247/wy/CN/Y2016/V01/I04/87