UHH
>
Informatik
>
NatS
>
ProSub16 Web
>
TranscriptionGuidelines
(21 Apr 2016,
KoljaKirsch
)
P
rint version
Transkription (/informatik/isr/nats/projects/lecture2go/done)
"filled pauses": ähm, äh, hm
Numbers: spell in full (ie nineteenhundredtwentyseven / neunzehnhundertsiebenundzwanzig [<- we did this] / neunzehn hundert siebenundzwanzig /...)
(might not be consistent)
Single letters: precede with $ (SE-3 --> $S $E drei)
Hesitations, discontinued words: mark with a dash: Logikprogramie-
Else no dash before a space! (Sprach- und Textverarbeitung -> Sprach und Textverarbeitung)
Punctuation: doesn't really matter
No upper-case letters at the start of a sentence, just words that should start with upper-case (we don't know where a sentence starts anyway)
foreign language stuff: doesn't matter
non-language stuff: put in [...] (ie [lachen] / [räuspern] / ...)
Questions from the audience: write as everything else
Maybe do this: [Q:] and [A:] before question or answer (didn't do that)
Might be too quiet to understand -> [Publikum]
Elision (Verschleifungen): write
hamwa
as
haben wir
, but if its close to lexical
hab ich
instead of
habe ich
write it as hab ich
There might be a "notizen" or "notes" file with additional information about the speaker or transcription in the lecture folder
--
KoljaKirsch
- 21 Apr 2016
ProSub16
Log In
Navigation
Research
Teaching
Publications
Download
People
Jobs
Contact
Changes
Courses
-
CIT 831 Natural Language Processing
More
Research
Constraint Dependency Grammar
Hamburg Dependency Treebank
Spoken Wikipedia Corpus
Construction Grammar
NatsWiki
Main
User
Sandbox
System
Copyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki?
Send feedback