Publication Type : Conference Paper
Publisher : Proceedings of the 10th Annual Meeting of the Forum for Information Retrieval Evaluation, Association for Computing Machinery
Source : Proceedings of the 10th Annual Meeting of the Forum for Information Retrieval Evaluation, Association for Computing Machinery, New York, NY, USA (2018)
Url : https://doi.org/10.1145/3293339.3293342
ISBN : 9781450362085
Keywords : Author Profiling, FIRE 2018, Indian languages, Native Language Identification
Campus : Coimbatore
School : School of Engineering
Center : Computational Engineering and Networking
Department : Electronics and Communication
Year : 2018
Abstract : The growth of digital platforms enables the industries to serve user specific services. Most of the time, the information of the internet users are not explicitly available and it acts as a constrain in developing the personalized applications. There comes the need for author profiling tasks, which intends to predict the internet users characteristics from their texts. Native language Identification is one among the author profiling task, that predicts the authors native language from their texts available in other language. We have proposed Indian Native Language Identification task, where the internet users texts are written in English and participants needs to find, whether the user's native language is from Tamil, Malayalam, Kannada, Telugu, Bengali and Hindi. The corpus is collected from texts from regional news paper pages available in Facebook by considering the hypothesis that the user belongs to a particular region will read the news from respective regional news paper.
Cite this Research Publication : A. Kumar M, Hb, B. Ganesh, Dr. Soman K. P., and SG, A., “Indian Native Language Identification - INLI 2018”, in Proceedings of the 10th Annual Meeting of the Forum for Information Retrieval Evaluation, New York, NY, USA, 2018.