About Talk Corpus

Watch the intro video!


Many language teachers are already familiar with TED Talks - a wide range of freely available video presentations given by expert speakers on a variety of topics. However, it is not a simple task to find a TED Talk of an appropriate speed or linguistic level for English language learners of a certain level, or to develop supplementary data, such as word lists, which are necessary to assist in the teaching of the language used in the talks. Talk Corpus is a web-based corpus that helps to solve some of these problems. Talk Corpus is comprised of 2,051 TED Talk videos, related meta-data, and supplementary linguistic data. For more information, read the associated research paper.

TalkCorpus.com was designed and developed by Paul Raine.

TED Talk videos, transcripts, and associated meta-data remains property of TED, and is used in accordance with the applicable Creative Commons license.

Search the corpus
Browse the corpus
TitleFKRE WPM LengthNAWL NGSL

Talk details