Repository logo
UWU eRepository
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Yкраї́нська
  • Log In
    New user? Click here to register.Have you forgotten your password?
Repository logo

UWU eRepository

  • Communities & Collections
  • All of DSpace
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Yкраї́нська
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Browse by Author

Browsing by Author "Singh, T."

Now showing 1 - 1 of 1
Results Per Page
Sort Options
  • No Thumbnail Available
    Item
    Resource Creation for English-Maithili Machine Translation (EMMT) A Divergence Perspective
    (Uva Wellassa University of Sri Lanka, 2018) Nidhi, R.; Singh, T.
    Maithili is one of the 22 scheduled Indian languages with almost no language technology resource. Absence of basic tools in this language has affected resource creation. Since English is the dominant language, translation from it can help creating the required corpora for tools development in Maithili. The present work discusses efforts for Language Technology Resource (LTR) creation and divergence study for an EMMT system, which is a Statistical Machine Translation (SMT) system. Creating any SMT system requires sizeable parallel, aligned corpora for training and testing. Creating general-purpose source corpora for English language and creating translation equivalents with possible alignments requires time and effort. The paper focuses on the data collection methods, cleaning, the size and structure of the text corpora, alignment and parallelization strategies, training, testing and a study of divergence between the language pair.
Copyright©2023.Uva Wellassa University, Sri Lanka |Maintained by Library-UWU