UPSI Digital Repository (UDRep)
Start | FAQ | About
Menu Icon


Browse by: Year_icon Subject Year_icon Publisher Year_icon Year
Total records found : 1
Simplified search suggestions : Siti Nordianah Hai Hom
12014
Article
An automatic bilingual corpora generator
Siti Nordianah Hai Hom
Bilingual corpora that contains similar documents of two different languages are examples of essential resources for Natural Language Processing (NLP) tasks including Cross-Lingual Information Retrieval (CLIR) and machine translation. Nevertheless, these resources could also be useful for many processes in learning languages. We introduce an automatic bilingual corpora generator that builds corpus resources from the web. This generator involves the use of the in-domain terms (IDT), in which the terms can be thought of as the most important contextually relevant words. The method used is simple yet practical, and makes acquiring resources from web sources more than just collecting texts and pasting them all together. However, as an on-going project, the system has not been fully implemented and evaluated. In this paper, the researchers emphasizes more on the prototype of the system in terms of appearance and display. For example, the generator shall be built on a webbased system that gi.....

7 hits

Filter
Loading results...



Specific Period
Loading results...



Top 5 related keywords (beta)

Loading results...



Recently Access Item




Installed and configured by Bahagian Automasi, Perpustakaan Tuanku Bainun, Universiti Pendidikan Sultan Idris
If you have enquiries, kindly contact us at pustakasys@upsi.edu.my or 016-3630263. Office hours only.