@Anne,  Andy. Great work indeed. I also found reichtagsprotokolle.de etc.
and thought I would need to collect the corpora myself, but it is
unnecessary now. Thanks!

What I'm wondering though if you also augmented the corpora with additional
tags such as speakers and what political parties they belonged to as I need
them for my semantic analysis.

--
Alexander Osherenko, Dr. rer. nat.
Socioware Development <http://www.socioware.de/osherenko_page.html>
Founder and R&D
LMU Project
<https://www.researchgate.net/project/Researching-radicalization-and-genocide>
Profile: ResearchGate
<https://www.researchgate.net/profile/Alexander_Osherenko>
Profile: Humboldt-Universität zu Berlin
<https://wirsindhumboldt.de/de/VKkZNyFaeu>
Channel: Youtube <https://www.youtube.com/user/MrOsherenko>


Am Fr., 9. Sept. 2022 um 15:16 Uhr schrieb Andy Lücking <
[email protected]>:

> Hi Alexander,
>
> Giuseppe Abrami and others of my colleagues at the Text Technology Lab
> in Frankfurt have collected a large corpus of German-language
> parliamentary debates at the national and federal levels. This
> includes parliamentary debates from Germany (since 1867), from
> Austria, Switzerland, and Liechtenstein. For Germany, debates from
> regional parliaments (where available) are also included. The
> German-language debates at the national level de facto also include
> the debates from the DeuParl period. The entire corpus is annotated
> with spaCy and is available in UIMA. At the same time, each document
> includes the session date and title in the meta-data (see
> http://www.lrec-conf.org/proceedings/lrec2022/pdf/2022.lrec-1.202.pdf).
> You
> can find the requested temporal sections on the corpus website:
> https://github.com/texttechnologylab/GerParCor
>
> Best,
>
> Andy
>
>
> Zitat von Anne Lauscher <[email protected]>:
>
> > Hi Alexander,
> >
> > There is a corresponding portion in our DeuParl corpus [1], which
> > contains speeches held in the German Reichstag and Bundestag.
> > The corresponding paper is this one: [2].
> >
> > Cheers
> > Anne
> >
> > [1]
> > https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2889?show=full
> > <https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2889?show=full>
> > [2] https://arxiv.org/pdf/2108.06295.pdf
> > ————
> > Dr. Anne Lauscher (she/ her)
> > Postdoctoral Researcher in Natural Language Processing
> > MilaNLP/ Data and Marketing Insights Unit
> > Bocconi University
> > Via Roentgen 1-2, 20136 Milan, MI, Italy
> > Website: https://anne-lauscher.de
> > Twitter: @anne_lauscher
> >
> >> On 9 Sep 2022, at 11:24, Alexander Osherenko <[email protected]> wrote:
> >>
> >> Hi all,
> >>
> >> I am looking for a historical corpus containing political speeches
> >> of the Weimar Republic in Germany (1919-1932).
> >>
> >> Best, Alexander
> >>
> >> --
> >> Alexander Osherenko, Dr. rer. nat.
> >> Socioware Development <http://www.socioware.de/osherenko_page.html>
> >> Founder and R&D
> >> LMU Project
> >> <
> https://www.researchgate.net/project/Researching-radicalization-and-genocide
> >
> >> Profile: ResearchGate
> >> <https://www.researchgate.net/profile/Alexander_Osherenko>
> >> Profile: Humboldt-Universität zu Berlin
> >> <https://wirsindhumboldt.de/de/VKkZNyFaeu>
> >> Channel: Youtube <https://www.youtube.com/user/MrOsherenko>
> >> _______________________________________________
> >> Corpora mailing list -- [email protected]
> >> https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
> >> To unsubscribe send an email to [email protected]
>
>
>
>
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

Reply via email to