- home
- Advanced Search
5 Research products, page 1 of 1
Loading
- Other research product . Other ORP type . 2014Open Access EnglishAuthors:Hogenaar, A.Th.; Witkamp, P.; Bruijne, M.C. de; Wijnant, Arnaud; Kvamme, Trond; Kvalheim, Vigdis; Recker, Astrid; Fihn, Johan; Berglund, Torbjörn; Jerlehag, Birger; +7 moreHogenaar, A.Th.; Witkamp, P.; Bruijne, M.C. de; Wijnant, Arnaud; Kvamme, Trond; Kvalheim, Vigdis; Recker, Astrid; Fihn, Johan; Berglund, Torbjörn; Jerlehag, Birger; Müller Gjesdal, Anje; Parra, Carla; Dione, Bamba; De Smedt, Koenraad; Engelhardt, Claudia; Ludwig, Jens; Lenkiewicz, Przemyslaw;Publisher: University of CopenhagenCountry: NetherlandsProject: EC | DASISH (283646)
This report was produced in the context of the project Data Service Infrastructure for the Social Sciences and Humanities (DASISH) work package 4.3 Convergence of Data Services. The goal has been to allow the selection and promotion of high-quality deposit services for researchers in the Social Sciences and Humanities (SSH) and to make suggestions for service improvements.
- Other research product . Other ORP type . 2011Open Access EnglishAuthors:Odijk, J.E.J.M.; Overkoepelend onderzoeksprogramma UiL-OTS; LS OZ Taal en spraaktechnologie;Odijk, J.E.J.M.; Overkoepelend onderzoeksprogramma UiL-OTS; LS OZ Taal en spraaktechnologie;Publisher: META-NETCountry: NetherlandsProject: EC | T4ME NET (249119)
- Other research product . Other ORP type . 2013Open AccessAuthors:Reinanda, R.; Odijk, D.; de Rijke, M.;Reinanda, R.; Odijk, D.; de Rijke, M.;Publisher: TAIA '13Country: NetherlandsProject: EC | PROMISE (258191), NWO | Semantic Search in E-Disc... (7999), NWO | Building Rich Links to En... (2300153702), NWO | SPuDisc: Searching Public... (2300176811), NWO | Modeling and Learning fro... (8686), EC | LIMOSINE (288024)
- Other research product . Other ORP type . 2014Open Access EnglishAuthors:L'Hours, Hervé; Offersgaard, Lene; Wittenberg, M.; Wloka, Bartholomäus;L'Hours, Hervé; Offersgaard, Lene; Wittenberg, M.; Wloka, Bartholomäus;Publisher: European CommissionCountry: NetherlandsProject: EC | DASISH (283646)
The aim of this task was to analyse and compare the different metadata strategies of CLARIN, DARIAH and CESSDA, and to identify possibilities of cross-fertilization to take profit from each other solutions where possible. To have a better understanding in which stages of the research lifecycle metadata comes to the fore, we looked at several research data lifecycles and business process models. However the current research data lifecycle models have the ‘static’ data object as basis, whereas metadata design , redesign, creation and management can continue to be ‘live’ issues within the research lifecycle. We therefore developed a metadata lifecycle based closely on familiar lifecycle models but extended to support the more dynamic metadata issues. To describe the metadata management of the different infrastructures we took a double approach. We looked on a more general level and outlined the policies and strategies regarding metadata of the three infrastructures. We evaluated these strategies on metadata qua lity issues with the Bruce and Hillmann criteria. On the other hand we looked with more detail how the work on metadata management is done by the individual data repositories. The infrastructures of CESSDA, CLARIN and DARIAH differ in visions, strategies and initiatives regarding metadata issues; similarly there is a difference in metadata management among the various repositories. Despite these differences, cross fertilisation by coordination on common lists of metadata elements, sharing of knowledge, and linking resources would leverage the overall metadata quality. Evaluation of the prototype of the joint CLARIN, DARIAH and CESSDA metadata portal endorses the opinion that more coordination is needed. Metadata quality must be discussed in relation to the activities for which they are used. We suggest that the infrastructures DARIAH and CLARIN prioritise future collaboration about standardisation efforts, which have already been in itialised in dialogue between the CLARIN Standards Committee and the DARIAH representatives. Similar initiatives could be established with CESSDA.
- Other research product . 2016Open Access EnglishAuthors:Otegi, Arantxa; Aranberri, Nora; Branco, António; Hajic, Jan; Neale, Steven; Osenova, Petya; Pereira, Rita; Popel, Martin; Silva, João; Simov, Kiril; +1 moreOtegi, Arantxa; Aranberri, Nora; Branco, António; Hajic, Jan; Neale, Steven; Osenova, Petya; Pereira, Rita; Popel, Martin; Silva, João; Simov, Kiril; Agirre, Eneko;
handle: 10451/33107
Publisher: European Language Resources AssociationCountry: PortugalProject: EC | QTLEAP (610516)This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference. The corpora comprise both the well-known Europarl corpus and a domain-specific question-answer troubleshooting corpus on the IT domain. English is common in all parallel corpora, with translations in five languages, namely, Basque, Bulgarian, Czech, Portuguese and Spanish. We describe the annotated corpora and the tools used for annotation, as well as annotation statistics for each language. These new resources are freely available and will help research on semantic processing for machine translation and cross-lingual transfer.
add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.
5 Research products, page 1 of 1
Loading
- Other research product . Other ORP type . 2014Open Access EnglishAuthors:Hogenaar, A.Th.; Witkamp, P.; Bruijne, M.C. de; Wijnant, Arnaud; Kvamme, Trond; Kvalheim, Vigdis; Recker, Astrid; Fihn, Johan; Berglund, Torbjörn; Jerlehag, Birger; +7 moreHogenaar, A.Th.; Witkamp, P.; Bruijne, M.C. de; Wijnant, Arnaud; Kvamme, Trond; Kvalheim, Vigdis; Recker, Astrid; Fihn, Johan; Berglund, Torbjörn; Jerlehag, Birger; Müller Gjesdal, Anje; Parra, Carla; Dione, Bamba; De Smedt, Koenraad; Engelhardt, Claudia; Ludwig, Jens; Lenkiewicz, Przemyslaw;Publisher: University of CopenhagenCountry: NetherlandsProject: EC | DASISH (283646)
This report was produced in the context of the project Data Service Infrastructure for the Social Sciences and Humanities (DASISH) work package 4.3 Convergence of Data Services. The goal has been to allow the selection and promotion of high-quality deposit services for researchers in the Social Sciences and Humanities (SSH) and to make suggestions for service improvements.
- Other research product . Other ORP type . 2011Open Access EnglishAuthors:Odijk, J.E.J.M.; Overkoepelend onderzoeksprogramma UiL-OTS; LS OZ Taal en spraaktechnologie;Odijk, J.E.J.M.; Overkoepelend onderzoeksprogramma UiL-OTS; LS OZ Taal en spraaktechnologie;Publisher: META-NETCountry: NetherlandsProject: EC | T4ME NET (249119)
- Other research product . Other ORP type . 2013Open AccessAuthors:Reinanda, R.; Odijk, D.; de Rijke, M.;Reinanda, R.; Odijk, D.; de Rijke, M.;Publisher: TAIA '13Country: NetherlandsProject: EC | PROMISE (258191), NWO | Semantic Search in E-Disc... (7999), NWO | Building Rich Links to En... (2300153702), NWO | SPuDisc: Searching Public... (2300176811), NWO | Modeling and Learning fro... (8686), EC | LIMOSINE (288024)
- Other research product . Other ORP type . 2014Open Access EnglishAuthors:L'Hours, Hervé; Offersgaard, Lene; Wittenberg, M.; Wloka, Bartholomäus;L'Hours, Hervé; Offersgaard, Lene; Wittenberg, M.; Wloka, Bartholomäus;Publisher: European CommissionCountry: NetherlandsProject: EC | DASISH (283646)
The aim of this task was to analyse and compare the different metadata strategies of CLARIN, DARIAH and CESSDA, and to identify possibilities of cross-fertilization to take profit from each other solutions where possible. To have a better understanding in which stages of the research lifecycle metadata comes to the fore, we looked at several research data lifecycles and business process models. However the current research data lifecycle models have the ‘static’ data object as basis, whereas metadata design , redesign, creation and management can continue to be ‘live’ issues within the research lifecycle. We therefore developed a metadata lifecycle based closely on familiar lifecycle models but extended to support the more dynamic metadata issues. To describe the metadata management of the different infrastructures we took a double approach. We looked on a more general level and outlined the policies and strategies regarding metadata of the three infrastructures. We evaluated these strategies on metadata qua lity issues with the Bruce and Hillmann criteria. On the other hand we looked with more detail how the work on metadata management is done by the individual data repositories. The infrastructures of CESSDA, CLARIN and DARIAH differ in visions, strategies and initiatives regarding metadata issues; similarly there is a difference in metadata management among the various repositories. Despite these differences, cross fertilisation by coordination on common lists of metadata elements, sharing of knowledge, and linking resources would leverage the overall metadata quality. Evaluation of the prototype of the joint CLARIN, DARIAH and CESSDA metadata portal endorses the opinion that more coordination is needed. Metadata quality must be discussed in relation to the activities for which they are used. We suggest that the infrastructures DARIAH and CLARIN prioritise future collaboration about standardisation efforts, which have already been in itialised in dialogue between the CLARIN Standards Committee and the DARIAH representatives. Similar initiatives could be established with CESSDA.
- Other research product . 2016Open Access EnglishAuthors:Otegi, Arantxa; Aranberri, Nora; Branco, António; Hajic, Jan; Neale, Steven; Osenova, Petya; Pereira, Rita; Popel, Martin; Silva, João; Simov, Kiril; +1 moreOtegi, Arantxa; Aranberri, Nora; Branco, António; Hajic, Jan; Neale, Steven; Osenova, Petya; Pereira, Rita; Popel, Martin; Silva, João; Simov, Kiril; Agirre, Eneko;
handle: 10451/33107
Publisher: European Language Resources AssociationCountry: PortugalProject: EC | QTLEAP (610516)This work presents parallel corpora automatically annotated with several NLP tools, including lemma and part of-speech tagging, named-entity recognition and classification, named-entity disambiguation, word-sense disambiguation, and coreference. The corpora comprise both the well-known Europarl corpus and a domain-specific question-answer troubleshooting corpus on the IT domain. English is common in all parallel corpora, with translations in five languages, namely, Basque, Bulgarian, Czech, Portuguese and Spanish. We describe the annotated corpora and the tools used for annotation, as well as annotation statistics for each language. These new resources are freely available and will help research on semantic processing for machine translation and cross-lingual transfer.
add Add to ORCIDPlease grant OpenAIRE to access and update your ORCID works.This Research product is the result of merged Research products in OpenAIRE.
You have already added works in your ORCID record related to the merged Research product.