Repository CLARIN-D Centre Leipzig


The CLARIN-D repository at the University of Leipzig offers longterm preservation of digital resources, along with their descriptive metadata. The mission of the repository is to ensure the availability and longterm preservation of resources, to preserve knowledge gained in research, to aid the transfer of knowledge into new contexts, and to integrate new methods and resources into university curricula.

CLARIN-D is developing a digital infrastructure for language-centred research in the social sciences and humanities. The main function of the CLARIN-D service centres is to provide relevant, useful data and tools in an integrated, interoperable and scalable way. CLARIN-D will roll the infrastructure out in close collaboration with expert scholars in the humanities and social sciences, to ensure that it meets the needs of users in a systematic and easily accessible way. Integration of the repository into the national CLARIN-D and international CLARIN infrastructures gives it wide exposure, increasing the likelihood that the resources will be used and further developed beyond the lifetime of the projects in which they were developed.

Among the resources currently available in the Leipzig repository are corpora of the Leipzig Corpora Collection / Project 'Deutscher Wortschatz', based on newspaper, Wikipedia and Web text. Furthermore, several REST-based webservices are provided for a variety of different NLP-relevant tasks.

read more

The CLARIN-D centre Leipzig offers the hosting of research data in its repository. Learn more about how to deposit your research data with us.

To the deposit page

Get an overview of our previous and current partners and collaborations.

To the partners list


The CLARIN corpora portal of the Wortschatz offers access to all corpora of the Leipzig Corpora Collection (LCC) that we already integrated into the CLARIN infrastructure.

To the LCC's CLARIN corpora portal

On this website you can contribute to corpus collection for under-resourced languages by simply entering a URL. We appreciate your support!

To the CURL portal

The ASV Toolbox is a modular collection of tools for the exploration of written language data.

To the online toolbox

Some of our data are available in RDF. The portal contains lexical and morphological data with their English translations and links to WordNet RDF.

To the RDF portal

Our tool for corpus comparison depicts the differences in vocabulary use of different textual resources.

Compare corpora

The Null Result Portal aims at making null results available to the scientific community.

Null results

Get an overview of the repository's resources as indexed in the Virtual Language Observatory.

To the Virtual Language Observatory

Certification Details

This repository has been awarded the Core Trust Seal.
This repository has been certified as
CLARIN Centre Type B.

Repository Details

Name: CLARIN-D Resource Centre Leipzig
Repository: Fedora Repository
Search in repository: Fedora Repository Search
Virtual Language Observatory: Search in the VLO
Corpus portal: Main page

Organization Details

Name: NLP Group, Department of Computer Science, University of Leipzig
Phone number: +49-(0)341-9732230
Privacy notice:
City, State, Country: Leipzig, Saxony, Germany
Postal adress: Universität Leipzig; Institut für Informatik; PF 100920; 04009 Leipzig; Germany