Calculating the Distance Between Near-Synonyms Across Languages: A Case Study on Chinese and Japanese
International Journal of Language and Linguistics
Volume 7, Issue 2, March 2019, Pages: 85-92
Received: Mar. 19, 2019;
Accepted: Apr. 26, 2019;
Published: May 15, 2019
Views 709 Downloads 85
Shi Jianjun, Institute of Linguistics, Shanghai International Studies University, Shanghai, China
Follow on us
The study of differences between near-synonyms across languages has always been a hot area of research in foreign language teaching and cross-language comparison. A linguistic phenomenon received special attention, Chinese-Japanese homographs often confuse learners since they have same forms yet differ slightly in terms of their meanings and usages. Traditional studies were unfolded mainly in two aspects: case studies on homograph discrimination and division of homographs according to meaning & usage distance. Researches concerning the latter aspect tend to distinguish near-synonyms between languages by means of a three-way classification, i.e. synonyms, heteronyms, and near-synonyms. However, this classification is far from satisfactory in that they cannot measure “near-synonymy” in an accurate and gradable manner since the term “near-synonymy” per se is too broad a term to define. This paper proposes a statistical method for calculating near-synonyms across languages by means of parallel corpus, where translation ratio, F-measure and inter-translation ratio are taken into account as parameters. By means of the F-value, this highly productive method is not only applicable in classifying synonyms, heteronyms, and near-synonyms between Chinese and Japanese, but also allows measuring the meanings & usage distance between cross-linguistic near-synonyms. To prove the effectiveness of this method, around 1900 pairs of Chinese and Japanese near-synonyms are compared and has gained good effects.
Near-Synonyms, Meaning & Usage Distance, Inter-Translation Ratio, F-measure
To cite this article
Calculating the Distance Between Near-Synonyms Across Languages: A Case Study on Chinese and Japanese, International Journal of Language and Linguistics.
Vol. 7, No. 2,
2019, pp. 85-92.
Copyright © 2019 Authors retain the copyright of this article.
This article is an open access article distributed under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/
) which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Shi J. J, Hong J. (2013) On a General Methodology for Contrastive Analysis of Chinese-Japanese Homographs. Foreign Language Teaching and Research, 45, pp 531-542.
Duan Y. G, Liu Y, Yu Sh. W. (2018) An Embedded Representation for "Tongyici Cilin" and Its Evaluation on Tasks, Journal of Xiamen University (Natural Science), 57(06), pp 867-875.
Li X. T, You Sh. J. Chen W. (2019), An Algorithm of Semantic Similarity between Words Based on Word Single-Meaning Embedding Model, Acta Automatica Sinica, DOI: 10.16383/j.aas.c180312.
Shi J. J. (2013) The Current Situation and Subjects of Synchronic Comparative Studies of Chinese-Japanese Homographs, Foreign Language Research in Northeast Asia, 1, pp 4-9.
The Agency for Cultural Affairs (1978) Kanji and their logographic Chinese counterparts, Research Materials for Japanese Language Education, pp 85-143.
Arakawa (1979) Chinese and Kanji: A Book Review on “Kanji and their logographic Chinese counterparts”, Aichi Forum on Language and Literature, 62, pp 388-361.
Oukouchi (1992) Homographs of Japanese and Chinese, Collection of Comparative Studies of Japanese and Chinese Languages, pp 179-215.
Shi J. J, Xu X. H. (2014) On Homographs in Chinese and Japanese Languages: a revisit. Journal of PLA University of Foreign Languages, 37, pp 132-139.
Ootsuka (1990) On Chinese-Japanese Homographs. Collection of Studies in foreign language education, 12, 327-337.
Institute of Linguistics CASS (2005) The Contemporary Chinese Dictionary 5th ed., Beijing: The Commercial Press.
Kindaichi, Ikeda (1988) Gakken Japanese Dictionary, 2nd ed, Tokyo: Gakken.
Lv Sh. X. (1980) Modern Chinese 800 Words, Beijing: The Commercial Press.
Beijing Center for Japanese Studies (2003) Chinese-Japanese Parallel Corpus, Beijing: Beijing Center for Japanese Studies. (Fruition of the Project: The Development and Application of Chinese-Japanese Parallel Corpus, No. 99BYY007).
Shi J. J, Qiao Y. (2016) Measuring the Semantic Distance of Chinese-Japanese Homographs: Approached from the F-measure Constructed on the Basis of Translation Ratio. Journal of PLA University of Foreign Languages, 39, pp 76-84+160.