SSGL: a semi-supervised grammar learner Online publication date: Sat, 07-Aug-2010
by K. Sundarakantham, N. Sheena, S. Mercy Shalinie
International Journal of Computer Applications in Technology (IJCAT), Vol. 38, No. 4, 2010
Abstract: Grammatical inference, also known as Grammar Induction, is about the problem of learning structural models from data. For decades researchers have been trying to devise formal and detailed grammars that would capture the observed regularities of language. This paper presents a comprehensive solution for efficient language acquisition by a novel semi-supervised algorithm that learns a streamlined representation of linguistic structures from a plain natural-language corpus. The input datasets are ATIS dataset and sentences from children's literature. The proposed algorithm generates rules from the given corpora and using the learned rules new sentences are generated. Performance of the algorithm is evaluated based on two measures – recall and precision. The recall was 0.935 and precision was 0.916. The results were found to be better than with other algorithms, such as EMILE, ADIOS and GCS. The running time of the algorithm is tested by varying the size of the dataset. It has shown a linear increment in time with the size of dataset.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Computer Applications in Technology (IJCAT):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com