Ia-1 NLP
Ia-1 NLP
Ia-1 NLP
a NLP:
It stands for natural language processing, which is a part of computer science,
artificial intelligence and human language.
It is a branch of AI that deals with interaction with human language and computers.
Ex: Chatbots, language translation etc.
Types:
1. NLU
2. NLG
In this diagram:
"S" represents the sentence structure, consisting of a subject (NP) and a predicate
(VP).
The subject "The cat" (NP) contains an article ("The") and a noun ("cat").
The predicate "caught the mouse" (VP) consists of a verb ("caught") and an object
(NP), which includes an article ("the") and a noun ("mouse").
The morphophonemic rule applied to the verb "caught" is represented within the
structure, indicating the pronunciation change from the base form "catch" to the past
tense form "caught" due to the addition of the past tense suffix "-ed".
3. Morphological Parsing:
Morphological parsing involves breaking down words into their constituent
morphemes, which are the smallest units of meaning or grammatical function.
Rules:
Lexicon:
The lexicon is a repository of morphemes and their corresponding meanings or
grammatical functions.
It gives the information about root word and it’s affix.
Morphotactic Rules:
Morphotactic rules govern the allowable combinations and order of morphemes
within a word. They specify the valid morphological structures in a language.
Orthographic Rules:
Orthographic rules define the correct spelling conventions for morphemes and
words within a language. They ensure consistency in written representations.
Word: "unhappiness"
Morphological Parsing:
Lexicon:
"un-" (prefix): negation or reversal
"happy" (stem): feeling or emotion
"-ness" (suffix): state or quality
Morphotactic Rules:
The prefix "un-" can attach to the beginning of a word to denote negation.
The suffix "-ness" typically attaches to adjectives to form abstract nouns indicating
a state or quality.
Orthographic Rules:
Orthographic rules dictate the spelling conventions, ensuring that the prefix, stem,
and suffix are spelled correctly according to standard conventions.
7.
8.