Corpora (English Textbooks in Turkey and English University Entrance Exams)

Several years ago, we wanted to investigate if English textbooks and university entrance exams in Turkey had any disconnect, if at all. This was partially because of curiosity, as well as the consensus (in Turkey) that the textbooks are too easy for the exams. So we wanted to test it. Both corpora were born out of a collaboration of roughly a year with Dr. Xiaoli Yu, Yağmur Su Kolsal, Beyza Dilbaz, and Halide İslamoğlu. 

First, we created the high school textbooks corpus between 2019 and 2020. Then, I expanded the corpus all the way down to grade 5. The corpus is freely available to all shortly and is in raw .txt format, which means it is not processed. However, when used in projects/studies/publications, please cite the following two studies: 

Gedik, Tan Arda. 2021. An Analysis of Lexicogrammatical Development in English Textbooks in Turkey: A Usage-Based Construction Grammar Approach. Explorations in English Language and Linguistics 9(1).

Gedik, Tan Arda. & Yağmur Kolsal S. 2022. A Corpus Based Approach to English University Entrance Exams and English High School Textbooks in Turkey. TAPSLA 8(1). 157–176. https://doi.org/10.31261/TAPSLA.8431.

The corpus mainly consists of student’s books, workbooks (when available), and listening tracks (when available). Please read the documentation to find out more about the corpus. 

We hope that this corpus will inspire future studies on textbooks.