10.24416/UU01-AR6LLU
Suijkerbuijk, Michelle
Michelle
Suijkerbuijk
https://orcid.org/0000-0002-1106-7580
Corpus of Dutch tweets containing kinship terms
Utrecht University
2022
Research Data
Humanities - Languages and literature (6.2)
definite articles
possessive pronouns
kinship
kinship terminology
corpus linguistics
Leufkens, Sterre
https://orcid.org/0000-0002-2251-1309
van der Meulen, Marten
https://orcid.org/0000-0002-2644-8319
2022-08-24T07:38:33.000000
2021-05-01/2021-12-31
en-us
1
Open - freely retrievable
Creative Commons Attribution 4.0 International Public License
A corpus of 2400 tweets, collected for research into marking of Dutch kinship terms. We searched Twitter for 24 Dutch kinship terms and selected the first 100 positive hits. A hit was considered positive when it included the kinship term, and the term was pre--modified by 1) a possessive pronoun 2) a definite article or 3) a zero-marker. It was excluded when the kinship term was used with a different meaning and when there was a post-modifier. All tweets are provided in .txt format.
Along with the corpus itself we include an Excel with all annotations, i.e. classification of the kinship terms, pre-modifiers, and data on the gender of authors, etc.