LIMES

LIMES 0.3 Screenshot

LInk discovery framework for MEtric Spaces

LIMES is a link discovery framework for the Web of Data. It implements time-efficient approaches for large-scale link discovery based on the characteristics of metric spaces. It is easily configurable via a web interface. It can also be downloaded as standalone tool for carrying out link discovery locally. In addition the Colanut GUI implements mechanisms for the automatic suggestion of link configurations.

Try LIMES 0.5RC1

Try GUI

Download LIMES



Latest news

LOD2 Webinar Series: LIMES ? Discovery of Links across Knowledge Bases
The 1st version of the LOD2 Stack has been published in September 2011 in the form of an LOD2 Stack demo and the downloadable LOD2 Stack virtual machine image – additional details and the instructions on installing the LOD2 Stack from scratch are available in the How-To-Start document. Born from the wish to make linking [...]

AKSW tools prominently featured in TÜB?TAK
Börtecin Ege wrote an article on the Semantic Web in the December issue of TÜB?TAK (The Scientific and Technological Research Council of Turkey) prominently featuring DBpedia, Relfinder, LIMES, SPARQL Benchmark and other AKSW related research projects. See Semantik Web Tübitak Bilim Teknik 12 2011.

Finally ? Assisted Link Discovery
Hello world, We are happy to announce that LIMES has been extended with an interface that will make linking easier than ever before. The COLANUT (Complex Linking in a NUTshell) interface implements time-efficient schema matching algorithms that allow LIMES to discover and suggest initial class and properties matchings for linking. The whole is embedded in [...]

General Overview

LIMES implements novel time-efficient approaches for link discovery in metric spaces. Our approaches utilize the mathematical characteristics of metric spaces to compute estimates of the similarity between instances. These estimates are then used to filter out a large amount of those instance pairs that do not suffice the mapping conditions. By these means, LIMES can reduce the number of comparisons needed during the mapping process by several orders of magnitude.


The general workflow implemented by the LIMES framework comprises four steps: Given a source, a target and a threshold, LIMES first computes a set exemplars for the target data source (step 1). This process is concluded by matching each target instance to the exemplar closest to it. In step 2 and 3, the matching is carried out. In the filterig step, the distance between all source instances and target instances is approximated via the exemplars computed previously (step 3). Obvious non-matches are then filtered out. Subsequently, the real distance between the remaining source and target instances are computed (step 3). Finally, the matching instances are are serialized, i.e., written in a user-defined output stream according to a user-specified format, e.g. NTriples (step 4).

Architecture

The LIMES framework consists of seven main modules of which each can be extended to accommodate new or improved functionality. The central modules of LIMES are the controller module, which coordinates the matching process and the data module, which contains all the classes necessary to store data.
The matching process is carried out as follows: First, the controller calls the I / O-module, which reads the configuration file and extracts all the information necessary to carry out the comparison of instances, including the URL of the SPARQL-endpoints of the knowledge bases, the restrictions on the instances to map (e.g., their type), the expression of the metric to be used and the threshold to be used. Examples of configuration files can be found in the distribution.


Given that the configuration file is valid w.r.t. the LIMES Specification Language (LSL), the query module is called. This module uses the configuration for the target and source knowledge bases to retrieve instances and properties from the SPARQL-endpoints of the source and target knowledge bases that adhere to the restrictions specified in the configuration file. The query module writes its output into a cache, which can be a file (for large number of instances, not implemented yet) or main memory. Once all instances have been stored in the cache, the controller calls the organizer module. This module carries out two tasks: first, it computes the exemplars of the source knowledge base. Then, it uses the exemplars to compute the matchings from the source to the target knowledge base. Finally, the I / O-module is called to serialize the results.

Running LIMES

Running LIMES can be done in two ways. You can use our hosted Linking Service or download the LIMES package and run it locally on your server (faster).

Publications

Further information

  • A manual for configuring and using LIMES can be found PDF Documenthere.
  • A paper containing a detailed description of LIMES and its mathematical foundations can be found PDF Documenthere.

Contact

Dr. Axel-C. Ngonga Ngomo
Johannisgasse 26, Zimmer 5-22
04103 Leipzig

Tel.: +49 341 97-32341
E-Mail, Workpage

Timofey Ermilov
Johannisgasse 26, Zimmer 5-20
04103 Leipzig

Tel.: +49 341 97 32322
E-Mail, Research Group, Workpage

Dr. Sören Auer
Johannisgasse 26, Zimmer 5-09
04103 Leipzig

Tel.: +49 341 97-32367
E-Mail, Homepage, Research Group, RDF Documentrdfs:seeAlso


 
There are 8 files on this page. [Display files/form]
There is no comment on this page. [Display comments/form]

Information

Last Modification: 2012-04-20 10:41:49 by Axel Ngonga