Preskoči na vsebino

No one needs dictionaries anymore. More people than ever before 
use dictionaries.

Not so much printed ones, but dictionary data are now built into mobile phones, software, systems and are in use all the time.

New and updated dictionaries are badly needed fast.

ELEXIS is not concerned with dictionary building itself, but it opens the existing resources, digitize legacy data, develop new language tools and services and make them accessible in an open network.

Elexis
ELEXIS facilitates fast building of high-quality dictionaries and databases.

Create a dictionary

You can create it manually from scratch using Lexonomy or automatically collect data with OneClick Dictionary.

Convert your dictionary

If you already have a dictionary, you can convert it with Elexifier.

Link senses

NAISC helps you to automatically link senses between two different datasets.

Edit your dictionary

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nam gravida mi a mi elementum.

Enrich your dictionary

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nam gravida mi a mi elementum.

Publish your dictionary

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nam gravida mi a mi elementum.

Elexis

Tools and services

Lexonomy

Lexonomy is a cloud-based dictionary-writing and also online-dictionary-publishing system which is highly scalable to adapt to large dictionary projects as well as small lexicographic works such as editing and online publishing of domain-specific glossaries or terminology resources. Lexonomy already interacts with Sketch Engine and the aim of the project is to develop and expand this interaction further. Sketch Engine can push lexicographic data into Lexonomy to create automatically generated dictionary drafts and Lexonomy can pull data from Sketch Engine’s corpora during the entry editing process.

elexifier

Elexifier is a cloud-based dictionary conversion service. It uses advanced XML parsing and machine learning techniques to help you convert your PDF and XML dictionaries in a standardized machine-readable format. Users can upload their PDF and custom XML dictionaries to Elexifier, define mapping rules for XML transformation or create a machine learning training set for PDF conversion and download the transformed XML or PDF dictionary in a TEI-compliant file format based on the Elexis Data Model.

elexiLink

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nam gravida mi a mi elementum, nec finibus lorem rutrum. Nulla sodales odio vitae risus malesuada, at fermentum enim vehicula.

News Feed

Lexicographic news feed is an ELEXIS service that uses the Event Registry API to extract latest news articles identified to be related to lexicography. News articles are extracted from 30,000 news sources, and over 35 languages are currently supported.

ElexiFinder

The search tool ELEXIFINDER is dedicated to helping lexicographers and other researchers find scientific output in lexicography and related fields. It enables users to search through papers and videos, using concepts, i.e. words or set of words with a Wikipedia page, and various other conditions, e.g. source (conference etc.), author, language etc. Each paper/video is linked to its page where the users can download or view it.

NAISC

NAISC 1.0 is a tool for linking datasets and was created by the SFI Insight Centre for Data Analytics and the ELEXIS project. NAISC serves as a system for aligning RDF datasets: It takes as input 2 RDF documents (referred to as ‘left’ and ‘right) and outputs an alignment (set of RDF triples) between these two documents. NAISC typically relies on a configuration, which is a JSON document.

OneClick Dictionary

OneClick Dictionary (OCD) is a dictionary drafting module. It interconnects a corpus management system (e.g. SketchEngine, noSketch Engine) or even excel sheets with our dictionary writing and online dictionary publishing system Lexonomy and provides an automatically created dictionary draft (e.g. headwords, wordforms, collocations, examples), to be post-edited in Lexonomy by the lexicographer. OneClick Dictionary enables lexicographers to shift all lexicographers work and intellectual input into the post-editing phase instead of manually analyzing the input data before creating a dictionary draft. Hence, the tool is not limited to professionals but also designed for spontaneous lexicography – small projects of lexicographic nature such as glossaries and domain-specific wordlists and dictionaries often prepared by teachers or other professionals without formal training in lexicography. The source code for the OneClick Dictionary module is available on Github, additional information is available in the Deliverable 4.2:

Clusty

Clusty is an innovative algorithm designed to perform lexical-semantic analytics for NLP: sense clustering. The team at the Linguistic Computing Laboratory of the Sapienza University of Rome investigated clustering approaches which allow to effectively and easily scale across languages whilst dropping the requirement of large amounts of data which is typically needed when employing neural networks. Clusty’s results can be used for improving word sense disambiguation systems. The demonstration of the efficacy of Clusty for performing one of the most challenging tasks in natural language processing, sense clustering, is presented in D3.1 (below). For installation we provide the link to GitHub repository.

MultiMirror

MultiMirror is a cross-lingual sense projection approach for multilingual WSD based on a novel discriminative word alignment model, capable of jointly aligning all source and target tokens with each other, surpassing its competitors across several language combinations. The sense-tagged datasets it produces lead a standard WSD classifier to achieve state-of-the-art performances on established benchmarks in French, German, Italian, Spanish and Japanese. MultiMirror was developed by the Sapienza Natural Language Processing Group (Sapienza NLP) and the ELEXIS project.

BabelNetLinker

The BabelNet Linker is a linking web service which produces a mapping between two dictionary definitions in a cross-lingual scenario. The BabelNet-linker API allows a dictionary to be linked to BabelNet at definition level. Specifically, this API allows a definition in any language to be mapped to a semantically-equivalent English definition in BabelNet by relying on state-of-the-art Transformer-based architectures. Importantly, this API will make it possible to map the dictionaries made available within the ELEXIS Consortium at definition level by pivoting through BabelNet. BabelNet Linker was developed by the Sapienza Natural Language Processing Group (Sapienza NLP) and the ELEXIS project.

VerbAtlas

VerbAtlas is a novel large-scale manually-crafted semantic resource for wide-coverage, intelligible & scalable Semantic Role Labeling. The goal of VerbAtlas is to manually cluster WordNet synsets that share similar semantics into sets of semantically-coherent frames.

Cross The Word

CrossTheWord is a crossword puzzle game for Android with small and big crossword puzzles, available for free download via the GooglePlay Store.

Elexis

About ELEXIS project

The ELEXIS project benefits from the expertise of some of the top experts in the fields of lexicography, linguistics and natural language processing, who agreed to share their experience and contribute their efforts to the success of the project.

EU

This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 731015.