LOD2 Webinar Series: LIMES ? Discovery of Links across Knowledge Bases
The 1st version of the LOD2 Stack has been published in September 2011 in the form of an LOD2 Stack demo and the downloadable LOD2 Stack virtual machine image – additional details and the instructions on installing the LOD2 Stack from scratch are available in the How-To-Start document. Born from the wish to make linking [...]
AKSW tools prominently featured in TÜB?TAK
Börtecin Ege wrote an article on the Semantic Web in the December issue of TÜB?TAK (The Scientific and Technological Research Council of Turkey) prominently featuring DBpedia, Relfinder, LIMES, SPARQL Benchmark and other AKSW related research projects. See Semantik Web Tübitak Bilim Teknik 12 2011.
Finally ? Assisted Link Discovery
Hello world, We are happy to announce that LIMES has been extended with an interface that will make linking easier than ever before. The COLANUT (Complex Linking in a NUTshell) interface implements time-efficient schema matching algorithms that allow LIMES to discover and suggest initial class and properties matchings for linking. The whole is embedded in [...]
General Overview
LIMES implements novel time-efficient approaches for link discovery in metric spaces. Our approaches utilize the mathematical characteristics of metric spaces to compute estimates of the similarity between instances. These estimates are then used to filter out a large amount of those instance pairs that do not suffice the mapping conditions. By these means, LIMES can reduce the number of comparisons needed during the mapping process by several orders of magnitude.

The general workflow implemented by the LIMES framework comprises four steps: Given a source, a target and a threshold, LIMES first computes a set exemplars for the target data source (step 1). This process is concluded by matching each target instance to the exemplar closest to it. In step 2 and 3, the matching is carried out. In the filterig step, the distance between all source instances and target instances is approximated via the exemplars computed previously (step 3). Obvious non-matches are then filtered out. Subsequently, the real distance between the remaining source and target instances are computed (step 3). Finally, the matching instances are are serialized, i.e., written in a user-defined output stream according to a user-specified format, e.g.
NTriples (step 4).
Architecture
The LIMES framework consists of seven main modules of which each can be extended to accommodate new or improved functionality. The central modules of LIMES are the controller module, which coordinates the matching process and the data module, which contains all the classes necessary to store data.
The matching process is carried out as follows: First, the controller calls the I / O-module, which reads the configuration file and extracts all the information necessary to carry out the comparison of instances, including the URL of the SPARQL-endpoints of the knowledge bases, the restrictions on the instances to map (e.g., their type), the expression of the metric to be used and the threshold to be used. Examples of configuration files can be found in the distribution.

Given that the configuration file is valid w.r.t. the LIMES Specification Language (LSL), the query module is called. This module uses the configuration for the target and source knowledge bases to retrieve instances and properties from the SPARQL-endpoints of the source and target knowledge bases that adhere to the restrictions specified in the configuration file. The query module writes its output into a cache, which can be a file (for large number of instances, not implemented yet) or main memory. Once all instances have been stored in the cache, the controller calls the organizer module. This module carries out two tasks: first, it computes the exemplars of the source knowledge base. Then, it uses the exemplars to compute the matchings from the source to the target knowledge base. Finally, the I / O-module is called to serialize the results.
Running LIMES
Running LIMES can be done in two ways. You can use our hosted
Linking Service or
download the LIMES package and run it locally on your server (faster).
Publications
- Axel-Cyrille Ngonga Ngomo und Klaus Lyko:
EAGLE: Efficient Active Learning of Link Specifications using Genetic Programming. In: Proceedings of ESWC 2012
- Mohamed Morsey, Jens Lehmann, Sören Auer, Axel Cyrille Ngonga Ngomo:
DBpedia SPARQL Benchmark – Performance Assessment with Real Queries on Real Data. Proceedings of ISWC2011, 2011. Best Paper Award.
- Axel-Cyrille Ngonga Ngomo and Sören Auer:
LIMES – A Time-Efficient Approach for Large-Scale Link Discovery on the Web of Data. Proceedings of IJCAI 2011.
- Axel-Cyrille Ngonga Ngomo:
A Time-Efficient Hybrid Approach to Link Discovery. ISWC'11 Workshop on Ontology Matching (OM-2011).
- Axel-Cyrille Ngonga Ngomo, Jens Lehmann, Sören Auer and Konraf Höffner:
RAVEN – Active Learning of Link Specifications. ISWC'11 Workshop on Ontology Matching (OM-2011).
Further information
- A manual for configuring and using LIMES can be found
here.
- A paper containing a detailed description of LIMES and its mathematical foundations can be found
here.
Contact
| Dr. Axel-C. Ngonga Ngomo Johannisgasse 26, Zimmer 5-22 04103 Leipzig
| |||||||
| Timofey Ermilov Johannisgasse 26, Zimmer 5-20 04103 Leipzig
| |||||||
| Dr. Sören Auer Johannisgasse 26, Zimmer 5-09 04103 Leipzig
| |||||||
Information
Last Modification:
2012-04-20 10:41:49 by Axel Ngonga