Using sign language corpora as bilingual corpora for data mining: Contrastive linguistics and computer-assisted annotation

Résultats de recherche: Contribution dans un livre/un catalogue/un rapport/dans les actes d'une conférenceArticle dans les actes d'une conférence/un colloque

77 Téléchargements (Pure)

Résumé

More and more sign languages nowadays are now documented by large scale digital corpora. But exploiting sign language (SL) corpus data remains subject to the time consuming and expensive manual task of annotating. In this paper, we present an ongoing research that aims at testing a new approach to better mine SL data. It relies on the methodology of corpus-based contrastive linguistics, exploiting SL corpora as bilingual corpora. We present and illustrate the main improvements we foresee in developing such an approach: downstream,
for the benefit of the linguistic description and the bilingual (signed - spoken) competence of teachers, learners and the users; and upstream, in order to enable the automatisation of the annotation process of sign language data. We also describe the methodology we are using to develop a concordancer able to turn SL corpora into searchable translation corpora, and to derive from it a tool support to annotation.
langue originaleAnglais
titreProceedings of the 7th workshop on the Representation and Processing of Sign Languages:Corpus Mining
Sous-titreLREC 2016
Pages159-166
Nombre de pages8
Etat de la publicationPublié - 2016
Evénement7th Workshop on the Representation and Processing of Sign Languages: Corpus Mining - Grand Hotel Bernardin Conference Center , Portoroz, Slovénie
Durée: 28 mai 201628 mai 2016

Série de publications

NomProceedings of the Workshop on the Representation and Processing of Sign Languages

Séminaire

Séminaire7th Workshop on the Representation and Processing of Sign Languages: Corpus Mining
Pays/TerritoireSlovénie
La villePortoroz
période28/05/1628/05/16

Empreinte digitale

Examiner les sujets de recherche de « Using sign language corpora as bilingual corpora for data mining: Contrastive linguistics and computer-assisted annotation ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation