Skip to Main Content

Digital Humanties: Corpora & Datasets

 

 

A photo of library with shelves of old books and wooden desks with turquoise lamps.

Image credit: iStock

 

 A drawing of two computers, a cell phone, and a tablet depicting various graphs.

Image credit: Getty Images

 

What are corpora and datasets?

A corpus and a dataset are similar in that they are both groups of information.  A corpus is a collection of written texts, while a dataset can be collection of any kind of data.  Corpora and datasets can be used by students to find new discoveries from the data included.  

Corpora and Datasets