10.6084/M9.FIGSHARE.3471839
Xavier Bost
0000-0002-5624-8721
Vincent Labatut
0000-0002-2619-2835
Georges Linarès
Serial Speakers: a Dataset of TV Series
<div><b>Dataset of three TV Series</b> with <b>manual</b> annotations.</div><div><br></div><div>Cite as:</div><div>@inproceedings{Bost2020,<br></div><div> title = {Serial Speakers: a Dataset of TV Series},<br></div><div> author = {Bost, Xavier and Labatut, Vincent and Linares, Georges},<br></div><div> url = {https://hal.archives-ouvertes.fr/hal-02477736},<br></div><div> booktitle = {12th International Conference on Language Resources and Evaluation (LREC 2020)},<br></div><div> address = {Marseille, France},<br></div><div> year = {2020}</div><div>}
</div><div><pre><br>The dataset consists of 3 TV series: <br></pre></div><br>- <i>Breaking Bad</i>: S01--S05 (file 'bb.json')<br>- <i>Game of Thrones</i>: S01--08 (file 'got.json')<br>- <i>House of Cards</i>: S01--S02 (file 'hoc.json')<br><br>All three files are in .json format and contain TV Series annotated data.<br><br>Each TV Series is defined by its <b>name</b>,<br><br>A TV Series contains <b>seasons</b>, defined by their <b>id</b>s.<br><br>Every season is made of <b>episodes</b>, defined by their <b><b>id</b></b>s,<b> title</b>s, <b>duration </b>and<b> fps </b>.<br><br>Each episode contains two basic kinds of <b>data</b>: <b>scenes</b> and <b>speech segments</b>.<br><br>Scenes are defined by <b>start</b>ing points and are made of <b>shots </b>(Seasons 1 only)<b>.<br><br></b>A shot is defined by<b>:<br><br></b><div>- <b>Start</b>ing and <b>end</b>ing positions.</div><div>-<b> </b>Recurring shot <b>id</b>s.</div><br><div>The speech segments are defined by their:</div><div><br></div>- <b>Start</b>ing and <b>end</b>ing points.<br>- <b>Text</b>ual content (here encrypted for copyright reasons).<br>- <b>Speaker</b>.<br>- Possible<b> interlocutors</b> (for the following episodes only: <b>bb</b>: S01E04, S01E06, S02E03, S02E04; <b>got</b>: S01E03, S01E07, S01E08; <b>hoc</b>: S01E01, S01E07, S01E11).<br><br><div>All timestamps are expressed in seconds and are valid for the video files extracted from the commercial DVDs (PAL 25 FPS), with recaps (unannotated) included at the beginning of the <i>House of Cards</i> episodes.</div><div><br></div><div>In you are interested in the textual content of the dataset, please consider using our text recovering tool on GitHub:</div><div><br></div><div>https://github.com/bostxavier/Serial-Speakers</div><div><br></div><div>A comprehensive description of the dataset can be found at:</div><div><br></div><div>https://hal.archives-ouvertes.fr/hal-02477736<br></div><b> </b><br>
80305 Multimedia Programming
figshare
2020
2016-07-06
2020-02-17
Dataset
47061256 Bytes
CC BY 4.0