|
Turkish Journal of Electrical Engineering & Computer Sciences
elektrik@tubitak.gov.tr |
Turkish Journal of Electrical Engineering & Computer SciencesAutomatic knowledge extraction for filling in biography forms from Turkish textsDepartment of Computer Engineering, Faculty of Engineering, Fatih University, Büyükçekmece, 34500, İstanbul-TURKEY e-mail: zorhan@fatih.edu.tr Abstract: This study presents a method for building an automatic knowledge extraction system for filling in biography forms from Turkish texts. Several biographies are analyzed in order to choose the set of biography categories to be studied. The fields of the biography form to be created are also defined based on this analysis. Information extraction techniques are used for implementation. A separate testing platform is designed to evaluate the accuracy of the extracted data. Results of the testing platform have shown this study to be a promising process to be further developed especially for creating forms in the Turkish language. Key Words: Information extraction, regular expression, biography form generation, natural language processing
Turk. J. Elec. Eng. & Comp. Sci., 19, (2011), 59-71. Full text: pdf Other articles published in the same issue: Turk. J. Elec. Eng. & Comp. Sci.,vol.19,iss.1. |