Publication Type : Journal Article
Source : 2022 OITS International Conference on Information Technology (OCIT)
Url : https://ieeexplore.ieee.org/abstract/document/10053800
Campus : Amritapuri
School : School of Computing
Center : Computational Linguistics and Indic Studies
Verified : No
Year : 2022
Abstract : In this work, we propose a semi-supervised boot-strapping approach for relation extraction in domain specific texts, specifically focusing on agricultural domain. Our approach utilizes the BERT model with dependency parsing for relation extraction. The proposed model, focuses on identifying five inter subdomain relations viz., Soil_Location, Soil_Crop, Disease_Pathogen, Pathogen_Crop, and Chemical_Crop. We created a corpus of 30,000 sentences extracted from recognised agriculture sites to evaluate the model. The labeled relations were then manually checked to evaluate the prediction accuracy. We used a test corpus with 700 sentences that included 3500 triplets for the evaluation. The proposed approach presents an average macro F -Score of 86.4 %, which is quite promising for semi-supervised domain specific relation extraction systems. Experimental results show the efficacy of the proposed approach in classifying relational phrases in a semi-supervised set-up for the agricultural domain.
Cite this Research Publication : Veena, G., Gupta, D., Kanjirangat, V. "Semi Supervised Approach for Relation Extraction in Agriculture Documents" 2022 OITS International Conference on Information Technology (OCIT), 2022