software . 2012 . Embargo end date: 13 Sep 2018

Multiword Extractor

Rubino, Francesco; Quochi, Valeria; Frontini, Francesca;
Open Access
  • Published: 12 Dec 2012
  • Publisher: Istituto di Linguistica Computazionale “A. Zampolli” - Consiglio Nazionale delle Ricerche (ILC-CNR)
Abstract
This is a lexical acquisition web-service for the automatic extraction of multiword expressions from large corpora. The service takes in input a POS-tagged corpus in CoNLL-X format plus a pair of POS-tags for the first and last word of a MWE, and outputs a list of extracted (candidate) multiword expressions with a set of linguistic and statistical information. The output can then be post-processed through filters that will refine and improve the accuracy of the extraction, and finally converted to an LMF-compliant XML lexical resource. The tool code is available open-source at https://github.com/francescafrontini/MWExtractor. Further details can be found in: Quo...
Persistent Identifiers
Funded by
EC| CELLCOM-GBS
Project
CELLCOM-GBS
Control of Streptococcus agalactiae virulence genes via peptide-based cell to cell communication
  • Funder: European Commission (EC)
  • Project Code: 327146
  • Funding stream: FP7 | SP3 | PEOPLE
Communities
Digital Humanities and Cultural Heritage
Any information missing or wrong?Report an Issue