@Anne, Andy. Great work indeed. I also found reichtagsprotokolle.de etc. and thought I would need to collect the corpora myself, but it is unnecessary now. Thanks!
What I'm wondering though if you also augmented the corpora with additional tags such as speakers and what political parties they belonged to as I need them for my semantic analysis.
-- Alexander Osherenko, Dr. rer. nat. Socioware Development http://www.socioware.de/osherenko_page.html Founder and R&D LMU Project https://www.researchgate.net/project/Researching-radicalization-and-genocide Profile: ResearchGate https://www.researchgate.net/profile/Alexander_Osherenko Profile: Humboldt-Universität zu Berlin https://wirsindhumboldt.de/de/VKkZNyFaeu Channel: Youtube https://www.youtube.com/user/MrOsherenko
Am Fr., 9. Sept. 2022 um 15:16 Uhr schrieb Andy Lücking < luecking@em.uni-frankfurt.de>:
Hi Alexander,
Giuseppe Abrami and others of my colleagues at the Text Technology Lab in Frankfurt have collected a large corpus of German-language parliamentary debates at the national and federal levels. This includes parliamentary debates from Germany (since 1867), from Austria, Switzerland, and Liechtenstein. For Germany, debates from regional parliaments (where available) are also included. The German-language debates at the national level de facto also include the debates from the DeuParl period. The entire corpus is annotated with spaCy and is available in UIMA. At the same time, each document includes the session date and title in the meta-data (see http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.202.pdf). You can find the requested temporal sections on the corpus website: https://github.com/texttechnologylab/GerParCor
Best,
Andy
Zitat von Anne Lauscher anne-lauscher@web.de:
Hi Alexander,
There is a corresponding portion in our DeuParl corpus [1], which contains speeches held in the German Reichstag and Bundestag. The corresponding paper is this one: [2].
Cheers Anne
[1] https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2889?show=full https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2889?show=full [2] https://arxiv.org/pdf/2108.06295.pdf ———— Dr. Anne Lauscher (she/ her) Postdoctoral Researcher in Natural Language Processing MilaNLP/ Data and Marketing Insights Unit Bocconi University Via Roentgen 1-2, 20136 Milan, MI, Italy Website: https://anne-lauscher.de Twitter: @anne_lauscher
On 9 Sep 2022, at 11:24, Alexander Osherenko osherenko@gmx.de wrote:
Hi all,
I am looking for a historical corpus containing political speeches of the Weimar Republic in Germany (1919-1932).
Best, Alexander
-- Alexander Osherenko, Dr. rer. nat. Socioware Development http://www.socioware.de/osherenko_page.html Founder and R&D LMU Project <
https://www.researchgate.net/project/Researching-radicalization-and-genocide
Profile: ResearchGate https://www.researchgate.net/profile/Alexander_Osherenko Profile: Humboldt-Universität zu Berlin https://wirsindhumboldt.de/de/VKkZNyFaeu Channel: Youtube https://www.youtube.com/user/MrOsherenko _______________________________________________ Corpora mailing list -- corpora@list.elra.info https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/ To unsubscribe send an email to corpora-leave@list.elra.info