In linguistics, a corpus plural corpora or text corpus is a huge and structured set of texts now usually electronically stored and processed. Multilingual corpora that have been specially formatted for side-by-side comparison are called aligned parallel corpora.