+A **Corpus** (plural: corpora) is a large, structured collection of [Language Data](/wiki/language-data), often compiled for [Text Analysis](/wiki/text-analysis). It serves as a foundation for studying patterns in communication and training computational models.
+## See also
+- [Natural Language Processing](/wiki/natural-language-processing)
+- [Linguistics](/wiki/linguistics)
+- [Data Set](/wiki/data-set)
... 1 more lines