Shallow parsing - Misplaced Pages

(Redirected from Chunking (computational linguistics)) Analysis of a sentence which first identifies constituent parts of sentences

This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Shallow parsing" – news · newspapers · books · scholar · JSTOR (February 2016) (Learn how and when to remove this message)

Shallow parsing (also chunking or light parsing) is an analysis of a sentence which first identifies constituent parts of sentences (nouns, verbs, adjectives, etc.) and then links them to higher order units that have discrete grammatical meanings (noun groups or phrases, verb groups, etc.). While the most elementary chunking algorithms simply link constituent parts on the basis of elementary search patterns (e.g., as specified by regular expressions), approaches that use machine learning techniques (classifiers, topic modeling, etc.) can take contextual information into account and thus compose chunks in such a way that they better reflect the semantic relations between the basic constituents. That is, these more advanced methods get around the problem that combinations of elementary constituents can have different higher level meanings depending on the context of the sentence.

It is a technique widely used in natural language processing. It is similar to the concept of lexical analysis for computer languages. Under the name "shallow structure hypothesis", it is also used as an explanation for why second language learners often fail to parse complex sentences correctly.

References

Citations

Jurafsky, Daniel; Martin, James H. (2000). Speech and Language Processing. Singapore: Pearson Education Inc. pp. 577–586.
Clahsen, Felser, Harald, Claudia (2006). "Grammatical Processing in Language Learners". Applied Psycholinguistics. 27: 3–42. doi:10.1017/S0142716406060024. S2CID 15990215.{{cite journal}}: CS1 maint: multiple names: authors list (link)

Sources

"NP Chunking (State of the art)". Association for Computational Linguistics. Retrieved 2016-01-30.
Abney, Steven (1991). "Parsing By Chunks | Principle-Based Parsing" (PDF). www.vinartus.net. pp. 257–278.

External links

Apache OpenNLP OpenNLP includes a chunker.
GATE General Architecture for Text Engineering GATE includes a chunker.
NLTK chunking
Illinois Shallow Parser Shallow Parser Demo

Types and standards	Corpus linguistics Lexical resource Linguistic Linked Open Data Machine-readable dictionary Parallel text PropBank Semantic network Simple Knowledge Organization System Speech corpus Text corpus Thesaurus (information retrieval) Treebank Universal Dependencies
Data	BabelNet Bank of English DBpedia FrameNet Google Ngram Viewer UBY WordNet Wikidata

Automatic identification
and data capture

Topic model

Computer-assisted
reviewing

Natural language
user interface

This computational linguistics-related article is a stub. You can help Misplaced Pages by expanding it.

Categories:

References

Citations

Sources

External links

See also