This paper seeks to computationally advance the Yoruba language by designing its rule based tagger. The work adopts Standard Theory and Principles and Parameters Theory to segment and instruct the computer system of the syntactic structure of the language through Prolog. Some hundred Yoruba words are coded to serve as lexicon or dictionary.
Through the words, some syntactic rules are as well programmed. The work tags Yoruba parts of speech of non-derivative sentences. It reveals that not all Yoruba NPs can complement prepositions in prepositional phrases (PPs). It is however made known that there is need to reclassify Yoruba words so as to enable machines like computers to generate grammatically acceptable Yoruba sentences.
Table of Contents
- Introduction
- The Yorùbá Language
- Literature Review
Objectives and Key Themes
This paper aims to computationally advance the Yorùbá language by developing a rule-based tagger. The study employs Standard Theory and Principles and Parameters Theory to segment and instruct the computer system through Prolog on the syntactic structure of the language.
- Computational advancement of the Yorùbá language
- Rule-based parts of speech tagging
- Syntactic structure of Yorùbá
- Lexicon development for Yorùbá
- Analysis of Yorùbá prepositional phrases
Chapter Summaries
- Introduction: This section defines the concept of parts of speech tagging (POS tagging) and its significance in natural language processing (NLP). It discusses the different types of POS taggers, including rule-based and stochastic models.
- The Yorùbá Language: This chapter provides background information on the Yorùbá language, its geographical distribution, and its dialectal variations.
- Literature Review: This chapter reviews existing efforts to computerize the Yorùbá language, highlighting previous works on Yorùbá POS tagging and their limitations. It emphasizes the need for a comprehensive and linguistically sound approach to Yorùbá POS tagging.
Keywords
The main keywords and focus topics of this paper are: parts of speech tagging, rule-based parts of speech tagging, Yorùbá language, computational linguistics, natural language processing, syntactic analysis, lexicon, and prepositional phrases.
Frequently Asked Questions
What is the main goal of the paper on Yorùbá POS tagging?
The paper aims to computationally advance the Yorùbá language by designing a rule-based parts of speech (POS) tagger for simple sentences.
Which programming language was used for this project?
The computer system was instructed on the syntactic structure of Yorùbá through Prolog.
What linguistic theories are adopted in the study?
The work adopts Standard Theory and Principles and Parameters Theory to analyze the syntactic structure of the language.
What did the research find regarding Yorùbá prepositions?
It revealed that not all Yorùbá Noun Phrases (NPs) can complement prepositions in prepositional phrases (PPs).
Why is a lexicon necessary for the POS tagger?
A coded lexicon of approximately one hundred Yorùbá words serves as a dictionary to help the system identify and tag word categories correctly.
What is the significance of this work for natural language processing?
It provides a foundation for computers to generate grammatically acceptable Yorùbá sentences and improves computational linguistics for the language.
- Quote paper
- Abiola Oyelere (Author), 2013, Rule Based Parts of Speech Tagging of Yorùbá Simple Sentences, Munich, GRIN Verlag, https://www.grin.com/document/347022