| Peer-Reviewed

The First Step Towards Suffix Stripping of Mising Words Using YASS

Received: 17 January 2016     Accepted: 24 February 2016     Published: 21 March 2016
Views:       Downloads:
Abstract

The authors used yet another suffix stripper (YASS) to find out the base words or stems for one of the languages of north-east India called Mising Language. There are over 5, 00,000 speakers in Mising Language. The Roman scripts are used for Mising Language. Mising Agom Kébang is the highest body of the Mising people and is dedicated for the development of Mising literature. The particular suffix remover may be used without in depth knowledge about the language. The authors successfully used the YASS with a F-score of around 87% for finding the stem. In the field of information retrieval, the automatic removals of suffixes are very important. As the mising language does not have a known corpus, the authors created the corpus.

Published in International Journal of Language and Linguistics (Volume 4, Issue 2)
DOI 10.11648/j.ijll.20160402.15
Page(s) 74-79
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2016. Published by Science Publishing Group

Keywords

Text Mining, Information Retrieval, Suffix Removal, Mising Language, YASS

References
[1] Abhijit Paul, Arindam Dey and Bipul Syam Purkayastha, 2014, An Affix Removal Stemmer for Natural Language Text in Nepali, International Journal of Computer Applications, Vol 91(6)
[2] A. G Jivani., 2011. A comparative Study of Stemming Algoritms, Int. J. Comp. Tech. Appl., Vol 2(6)
[3] B. R Prasad., 1991. Mising Grammar, Central Institute of Indian Languages
[4] Dalwadi Bijal and Suthar Sanket, 2014, Overview of Stemming Algorithms for Indian and Non-Indian Languages, International Journal of Computer Science and Information Technologies, Vol. 5(2)
[5] E. A. Gait,. 1905. A History of Assam. Calcutta: Thacker, Spink & Co
[6] Navanath Saharia, U Sharma, J Kalita, 2012, Analysis and evaluation of stemming algorithms: a case study with Assamese, Proceedings of the International Conference on Advances in Computing, Communications and Informatics
[7] Padmaja Sharma, U. Sharma, J. Kalita, 2012, Suffix stripping based NER in Assamese for location names, 2nd National Computational Intelligence and Signal Processing (CISP)
[8] Paice, 1990, Another Stemmer, ACM SIGIR Forum, Vol 24 (3)
[9] Prasenjit Majumder, Mandar Mitra, Swapan k. Parui, Gobinda Kole, Pabitra Mitra and Kalyankumar Datta, 2007, Yass: yet another suffix stripper, ACM Transactions on Information Systems, Volume 25, Issue 4
[10] Reinaldo Viana Alvares, Ana Cristina Bicharra Garcia, Inhaúma Ferraz, 2005, STEMBR: A Stemming Algorithm for the Brazilian Portuguese Language, Progress in Artificial Intelligence, Volume 3808 of the series Lecture Notes in Computer Science
[11] T. Taid, 1987, Linguistics of the Tibeto-Burman Area, Volume 10.1
[12] T. Taid, 2010, A dictionary of the Mising language: with an introduction to Mising phonology and grammar
Cite This Article
  • APA Style

    Sadiq Hussain, Rizwan Rehman, G. C. Hazarika, J. J. Kuli. (2016). The First Step Towards Suffix Stripping of Mising Words Using YASS. International Journal of Language and Linguistics, 4(2), 74-79. https://doi.org/10.11648/j.ijll.20160402.15

    Copy | Download

    ACS Style

    Sadiq Hussain; Rizwan Rehman; G. C. Hazarika; J. J. Kuli. The First Step Towards Suffix Stripping of Mising Words Using YASS. Int. J. Lang. Linguist. 2016, 4(2), 74-79. doi: 10.11648/j.ijll.20160402.15

    Copy | Download

    AMA Style

    Sadiq Hussain, Rizwan Rehman, G. C. Hazarika, J. J. Kuli. The First Step Towards Suffix Stripping of Mising Words Using YASS. Int J Lang Linguist. 2016;4(2):74-79. doi: 10.11648/j.ijll.20160402.15

    Copy | Download

  • @article{10.11648/j.ijll.20160402.15,
      author = {Sadiq Hussain and Rizwan Rehman and G. C. Hazarika and J. J. Kuli},
      title = {The First Step Towards Suffix Stripping of Mising Words Using YASS},
      journal = {International Journal of Language and Linguistics},
      volume = {4},
      number = {2},
      pages = {74-79},
      doi = {10.11648/j.ijll.20160402.15},
      url = {https://doi.org/10.11648/j.ijll.20160402.15},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijll.20160402.15},
      abstract = {The authors used yet another suffix stripper (YASS) to find out the base words or stems for one of the languages of north-east India called Mising Language. There are over 5, 00,000 speakers in Mising Language. The Roman scripts are used for Mising Language. Mising Agom Kébang is the highest body of the Mising people and is dedicated for the development of Mising literature. The particular suffix remover may be used without in depth knowledge about the language. The authors successfully used the YASS with a F-score of around 87% for finding the stem. In the field of information retrieval, the automatic removals of suffixes are very important. As the mising language does not have a known corpus, the authors created the corpus.},
     year = {2016}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - The First Step Towards Suffix Stripping of Mising Words Using YASS
    AU  - Sadiq Hussain
    AU  - Rizwan Rehman
    AU  - G. C. Hazarika
    AU  - J. J. Kuli
    Y1  - 2016/03/21
    PY  - 2016
    N1  - https://doi.org/10.11648/j.ijll.20160402.15
    DO  - 10.11648/j.ijll.20160402.15
    T2  - International Journal of Language and Linguistics
    JF  - International Journal of Language and Linguistics
    JO  - International Journal of Language and Linguistics
    SP  - 74
    EP  - 79
    PB  - Science Publishing Group
    SN  - 2330-0221
    UR  - https://doi.org/10.11648/j.ijll.20160402.15
    AB  - The authors used yet another suffix stripper (YASS) to find out the base words or stems for one of the languages of north-east India called Mising Language. There are over 5, 00,000 speakers in Mising Language. The Roman scripts are used for Mising Language. Mising Agom Kébang is the highest body of the Mising people and is dedicated for the development of Mising literature. The particular suffix remover may be used without in depth knowledge about the language. The authors successfully used the YASS with a F-score of around 87% for finding the stem. In the field of information retrieval, the automatic removals of suffixes are very important. As the mising language does not have a known corpus, the authors created the corpus.
    VL  - 4
    IS  - 2
    ER  - 

    Copy | Download

Author Information
  • Dibrugarh University, Dibrugarh, Assam, India

  • Centre for Computer Studies, Dibrugarh University, Dibrugarh, Assam, India

  • Department of Mathematics, Dibrugarh University, Dibrugarh, Assam, India

  • Department of Ophthalmology, Assam Medical College, Dibrugarh, Assam, India

  • Sections