Impressions

Brought to you by:

broadcasted by

videolectures

Sponsored by:

active_logo

aws

franz

iqser_small

logo

neon

ontotext

sti_ibk_logo
value it

Supported by:

avco_logo

celtic

isoco

scheffield

Sponsor ESTC2009

Place your Logo here!

Accepted Papers PDF Print E-mail

CASE STUDIES I (Semantics in the Enterprise)

1. Business Semantics-driven Integration of Service-oriented Applications | Pieter De Leenheer, Collibra nv/sa

Business semantics define the contextual meaning of key business assets for your organization in terms of business facts and rules. Business Semantics have a dual utility: the derived business semantics do not only provide a shared glossary to augment human understanding, but can also be used to automate otherwise costly and error-phrone data integration during complex business process integration.
Many organisations start to realise the potential of business semantics to leverage information management, and take initiatives at the grass roots. However sustainable and meaningful business semantics management must be organised and cultivated organisation-wide by the right balance of people, methods, and tools.


2. Semantic Business Portal - A use case from Nanotechnology | Christian Ehrlich, Ontos AG

Portals are not new but are typically based on aggregation of structured information. Highly innovative industries like the one for Nanotechnology need to integrate information from various sources with a large portion being from unstructured text. This session will demonstrate how semantic technologies, especially text mining tools, allow to "semantize" content and to interlink pieces together for better decision making. Different stake holders need different information and the presented "Nanotrack" portal links patents, scientific papers, news, blogs and other sources together. In the session the user will learn how implicit connections are identified, how possible trends are recognized and how the system supports a research team in the identification process of skills of persons and companies. The underlying technology is a framework for crawling, ontology driven information extraction and a scalable RDF store.


3. Developing a software factory business plan | Alfonso de la Fuente Ruiz, CPIICYL

We, the people, intelligently manage information in a variety of ways. The birthright of ubiquitous computing, of the Internet and of mobile devices (such as smartphones, handhelds and PDAs) renders the marketing of new products and services possible to be embedded within the knowledge economy.
Best utility is attained when human-computer interaction is fluid, which is only possible when advanced, flexible and accessible software interfaces are developed.
Centralization and homogenization of semantic databases, along with the thriving of communication in their capillarity are two key points that act by catalyzing a technological revolution, in the way of accessing the compendium of human knowledge, storing its representations and treating those, thus allowing for exponential increases in the productivity and efficiency of organizations.


4. Flexible Knowledge Generation and Re-Use in Aerospace Engineering using the XMediaBox | Aba-Sah Dadzie, The University of Sheffield

We employ a scenario describing the issue resolution process in the aerospace engineering domain to determine requirements for enhanced knowledge management and intuitive analysis. We discuss the use of Semantic Web technologies in an interactive knowledge framework to support the stages of the knowledge management lifecycle: knowledge generation, manipulation and use, enrichment, sharing and re-use. The framework makes use of a modular domain ontology that defines users and their tasks, and the physical objects users interact with during their normal working activities, to guide the hierarchical, context-driven hypothetical analysis performed during issue resolution. The ontology also guides the enrichment of the knowledge generated as a by-product of users' interaction with the knowledge framework. An important requirement was tool design that improves users' analytical ability and knowledge capture and use, without increasing users' cognitive load or disrupting their normal way of working.
Our approach moves traditional, document-centric analysis to the more abstract but effective, conceptual focus on the knowledge contained within data.

5. MARLIES - WebIE Supporting the Supply of Manufacturing | A.Univ.-Prof. Dr. Birgit Proell, Johannes Kepler University Linz

Sourcing and procurement are mayor tasks of supply chain management, which increasingly benefit from Web technologies in that companies make information on their services retrievable online for the prospective consumers. Web Information extraction technologies constitute the prerequisite to enable the semi-automatic support for finding the best fitting service supplier on the Web. In our presentation we introduce MARLIES, which is targeted on extracting contact data and data on services, e.g., machines and manufacturing processes offered by companies via their heterogeneously designed Web sites. MARLIES implements an ontology-aware knowledge engineering approach based on the text engineering framework GATE. Major challenges are (1) the ontological modelling of the complexly related technical data, (2) the extraction of relations between entities addressing structural aspects, e.g., related entities concealed in nested tables, and (3) ranking of results using various relevance criteria.



CASE STUDIES II (Search)


1. KnowleSuite: Bringing deep search to end-users with Semantic Technologies | K. Rajaraman and Ang Wee Tiong, Institute for Infocomm Research

Digital information available as un/semi-structured data is estimated to comprise of 53% of all available information in enterprises, and little of it makes way into data warehouses. In addition, state-of-the-art search technologies for accessing  information from online sources produce highly efficient query issue but are in-effective  in delivering targeted results. They provide little support to articulate complex queries which results in poor delivery and indexing of harvested information. We present a generic service-oriented semantic technology infrastructure, called Knowle platform, that uses a combination of text mining, ontologies, and reasoning techniques, and enables visual query navigation and data mining of un/semi-structured, and structured data indexed with OWL-DL ontologies. Knowle has been applied to a variety of domains including Healthcae & Life sciences, Business Intelligence, Patent Landscaping and Campaign management.

2. Demo: Extending Traditional Enterprise Search using Semantic Techniques | Steve Fullerton and Ian Pallen, Solcara Limited

Building on the power and flexibility of a federated search tool that enables an asynchronous search approach to internal and external information sources, this demonstration shows how a semantic federating layer, using the OpenRDF Sesame SAIL API, integrates traditional and semantic data sources.
Input/output data properties identifying each source’s data classes are semantically mapped. Searches are executed at query time only on those sources relevant to query the class of data required. Source parameter relationships may generate further federated, correlated sub-queries to iterate over intermediate results to create a final result set. The system caters for authentication, reaching content not indexed by global indexers.
Repositories enhanced with this layer can process SPARQL queries referencing local RDF triples and patterns triggering federated searches to create an integrated result set. This opens up new opportunities for enterprises building semantic knowledge bases while retaining traditional data stores and accessing external information.

3. The IRF and making search research more relevant to practice | Sabine Ringhofer, IRF

The Information Retrieval Facility (IRF) is an international not-for-profit foundation based in Vienna. The IRF is dedicated to the promotion and facilitation of research on large-scale information retrieval and is controlled by a Scientific Board consisting of 13 distinguished scientists in information retrieval and related disciplines, chaired by K. van Rijsbergen. The IRF is a member of the LarKC project consortium, which is building an infrastructure for very large scale inference with a semantic web environment. Another major focus for the IRF is the evaluation of the effectiveness of information access and search systems especially for patents. The IRF initiated and took a leading role in the NIST TREC CHEM 2009 and the CLEF IP 2009 evaluation.The IRF is a membership organization open to both academic and industrial members. A high performance computing facility is operated by the IRF in Vienna, and made available to its members for research purposes. Standard IR packages like Lemur/Indri and Terrier are available. Large datasets are also made available, including MAREC, a corpus of 19 million patent documents and the ClueWeb09 collection of a thousand million web pages.


BUSINESS TOOLS AND APPLICATIONS I (Business Intelligence)

1. Sig.ma: to see the Web of data, to foresee its future | Giovanni Tummarello, DERI

We present Sig.ma, both a service and an end user application to access the Web of Data as an integrated information space.
Sig.ma uses an holistic approach in which large scale semantic web indexing, logic reasoning, data aggregation heuristics, ad hoc ontology consolidation, external services and responsive user interaction all play together to create rich entity descrip- tions. These consolidated entity descriptions then form the base for embeddable data mashups, machine oriented services as well as data browsing services. Finally, we discuss Sig.ma's peculiar characteristics and report on lessions learned and ideas it inspires.

2. WEB 2.0 and Semantic Technologies for Open Innovation | Guillermo Alvaro, iSOCO

In recent years, Web 2.0 has achieved a key role in the Internet community and applications based on collective intelligence have grown in importance in the enterprise context. Companies are adopting this paradigm for their internal processes in the so-called Enterprise 2.0. The integration of Enterprise 2.0 with semantic technology seems to be essential for its successful adoption with a positive return of investment. On the other hand, the introduction of the Open Innovation model, for which the innovation process should be opened out of the R&D department to all the employees and external actors, requires a technological infrastructure to be supported. In this paper we discuss how the Web 2.0 philosophy and Semantic Technology support Open Innovation and how three big European companies have profited from this new paradigm.



BUSINESS TOOLS AND APPLICATIONS II (Knowledge Management)

1. Practical applications of Semantic MediaWiki in commercial environments - Case Study: semantic-based project management | Daniel Hansch, ontoprise GmbH

Semantic MediaWiki, the semantic extension of MediaWiki, promises a big potential for a particular class of commercial applications which share common characteristics. Project management is an example of this class of applications where Semantic MediaWiki appears to be well suited to fuse text and data and – as a side effect – to foster collaborative work in a project team.
This presentation shows how semantic MediaWiki can be successfully employed for incremental knowledge formalization in the context of complex projects.
We demonstrate how the project management and the working domain can be amalgamated such that project data is easily captured from working documents.
Futher we discuss the suitability of semantic MediaWiki for project management and future challenges and directions for semantic-based project management.


2. The NEPOMUK Semantic Desktop - Solutions, Tools, and Providers | Leo Sauermann, gnowsis

How can knowledge workers benefit from the semantic web?
Today’s operating systems as such do not provide a semantic layer.  The Social Semantic Desktop is a radical application of Semantic Web technologies on the desktop. Personalized semantic services allow applications to categorize and link information based on the individual work processes and needs of the users.
Existing data is converted to RDF, enabling new annotations and data analysis across applications. From the user's perspective documents can be annotated with related contacts, projects, and topics, which helps finding information.
From 2006 to 2008, the EU funded NEPOMUK project developed these services, ontologies, and applications. Now, this technology is currently deployed to millions of KDE 4.0 users. Nokia has adapted it for the mobile Maemo platform. The NEPOMUK spin-off company gnowsis.com offers support for enterprise and groupware applications.


3. SKOS Thesaurus Management based on Linked Data | Andreas Koller, punkt. netServices GmbH

The process of building and maintaining thesauri, especially in corporate settings, quite often cannot be justified due to its unfavourable cost-benefit ratio. Combining different approaches to build thesauri like text-mining, machine learning, expert driven thesaurus modelling and linking open data seems to be a promising methodology to overcome this obstacle. This paper gives an overview over a thesaurus building methodology based on Linked
Data which was implemented in PoolParty which is a Thesaurus Management Tool (TMT) for the Semantic Web.

 



BUSINESS TOOLS AND APPLICATIONS III (Spatio-temporal Data, Web Services)

1. Moving Objects: Look in the Past, Manage the Present and Predict the Future | Jans Aasman, Franz Inc.

We see an increased interest from telecom providers, the transportation industry and defense integrators in tracking hundreds of thousands to millions of moving objects in real time. Consider fleets of trucks, swarms of airplanes, track data for soldiers on the battlefield, or track data for endangered animals.
An RDF (Semantic Web) Database is a thin layer on top of a Graph Database.  Data stored in a linked graph is very different from data stored in a table, and the storage has different strengths.  In this presentation we'll show AllegroGraph’s RDF database features for highly-scalable storage and search of spatial and spatio-temporal (moving-object) data.
We discuss the efficiency and cost of various kinds of data analysis. We informally propose five kinds of queries with increasing execution cost.
Application examples will include representation of river drainages, and detecting patterns of ship traffic in San Francisco Bay.


2. Geo-Semantic Time series for Tourism | Robert Barta, rho information systems

Future semantically-enhanced tourist information systems will be very dynamic. They will focus more on individual tourists, their preferences and their current locations. To interpret this data properly these holistic, situation- aware systems will need to respect more external, such as weather or traffic conditions.
All dynamic data can be organized into time sequences, i.e. sequences of values, each value not only carrying a core message, but also some circumstantial information (when, how, where).
Our proposed approach is not only to represent these time sequences within a semantic network store (in our case AllegroGraph). More importantly we demonstrate how to use an appropriate functional language, Formula 3, to compute new time sequences based on existing ones. These transformations can be time aggregations, but they can also include the underlying semantic network.
Newly derived values then mark the emergence of events which can be reported back into the application code.


3. Web Service Discovery on Large Scale | Nathalie Steinmetz, seekda GmbH

Searching for Web Services on the Web today is a hard task. General-purpose search engines are not precise enough and dedicated service portals are too limited in scale. Service-Finder helps in solving this problem. It employs Information Retrieval and Extraction techniques (like crawling, automatic annotation and indexing) as well as Semantic Technologies, to turn the unstructured and distributed information about Web Services into structured data that expresses service semantics with respect to service-domain ontologies. The result is a Web portal (http://demo.service-finder.eu) offering search functionalities to discover services. In comparison with existing solutions, it achieves both a higher recall, because it indexes more than 25,000 services, and a higher precision, thanks to the semantic annotations that enrich the service descriptions. This represents a big advantage for end users, who can easily discover unknown services that fit their need.

4. Practical Applications of Semantic Technologies in the Enterprise | Mike Cataldo, CEO Cambridge Semantics

For the enterprise, semantic technology offers a paradigm shift that will impact businesses on the same scale as the introduction of the personal computer or the advent of the internet. Semantic technology can put power in the hands of every day users that was previously the exclusive domain of IT, creating dramatic improvements in efficiency and responsiveness that can change the game for many businesses. In order to realize this potential, semantic technology must be delivered in a lightweight package that allows for rapid realization of value that can scale throughout the enterprise. Further, the ability to build solutions must be well within the reach of non-technical users without the assistance of IT if desired. This session will show actual examples of how semantic technology has been deployed in just such a way, going from initial installation to enterprise-wide production deployment in a matter of days for the largest buyer of advertizing media in the world. In the context of this solution, you will see how spreadsheets are easily linked to a semantic data collaboration
server and how data from those spreadsheets can be manipulated
and browsed in a Web visualization tool. 


USING TECHNOLOGIES I (Methodologies and Approaches)

1. Bottom-up Ontology Design with Semantic Rules | Chris Moran, Tellilink Corporation
Designing a system that uses semantic rules requires not only a new way of thinking about the business logic of the system but also the semantics of the system. Semantic rules are usually thought of as an extension of the model—as a way to enable rules for classification to be expressed in an ontology. In practice, this approach creates overly complex and fragile models, and overly simplistic logic.
This paper introduces an approach to building systems with semantic rules that separates system logic from the system model and system data. It enables models to be simple and general; and it enables logic to be applied to a concept in the context of a system or business process. This separation of the model and the logic means that very general models can be tailored for very specific use. To think of it in human terms: a person’s response to “danger” might always be the same—fight or flee—we are very general systems in a sense. But the concept of Danger, and the logic of classifying something as Dangerous, changes constantly depending on the situation. What distinguishes us as a species is that concepts like Danger are not hard-wired into us. The logic that lets us classify a situation as Dangerous is separate from the concept itself and how we respond to it.

2. Semantic Approach to Privacy Protection | Miroslav Blako, Czech Technical University in Prague
P3P is a W3C recommendation for expressing service privacy practices in a structured computerreadable form. However, the lack of formal semantics makes it inconvenient for fully computerized matching against privacy preferences (e.g. the APPEL language). SemPref is an interesting approach that defines several possible semantics for P3P using relational database technology. However, it still doesn't have capabilities to detect errors that frequently occur when designing both privacy preferences and service privacy practices. This paper analyses the problem of detecting semantic errors in P3P policies in depth and proposes a solution based on semantic technologies, namely the OWL Web Ontology Language and the SPARQL-DL query language.

3. Design of a Cost Efficient Methodology to Compare Search Methods | Robert Tolksdorf, Freie Universität Berlin
Since the advent of the Internet, search engines play a prominent role for the search and discovery of resources. They developed from pure full text indexing mechanisms over indexing systems accounting for the structure of the web, to systems, which produce answers instead of plain resource lists. However, the methods for comparing their result quality remained nearly unchanged over the years and are still based on the information retrieval measures: precision and recall. Even though these measures are well understood, they lack certain important properties when applied to internet search engines. In order to overcome their weaknesses, we present a new, cost efficient approach for comparing search methods based on their results. We used this new method to compare quantitatively the semantic search technology developed by our company with the full text search of a major topic-specific internet portal.

4. Semantic Mashups for the Masses | Thorsten Liebig, Derivo GmbH
This talk introduces Jigs4OWL, a semantic mashup framework which offers a web-enabled semantic infrastructure while hiding all non-relevant technical details. It allows to easily build web browser applications which aggregate, visualize, and mashup semantic content.
The framework consists of a server which hosts semantic data uploaded by end users.
The framework also delivers a collection of graphical widget templates for dealing with this content. A semantic application is build by customizing the templates and combining it with further widgets (such as Google Maps) or other data sources. This allows to develop consumer and business mashups with a few lines of code and without the burden of operating a complex application back-end.
The framework can be either driven by a company as an internal or external service or employed on-demand from a framework service provider. A public accessible instance of the latter is online available for evaluation at www.jigs4owl.com.
In this talk we introduce the mashup framework and discuss the wide range of application scenarios. We will also exemplarily show how to adapt and configure it at one‘s own needs.

USING TECHNOLOGIES II (Linked Open Data)

1. Reasoning with a Billion of Linked Data Facts | Atanas Kiryakov, Ontotext AD

LDSR is a collection of linked data datasets, selected and refined to present a reason-able view to the center of the LOD cloud. LDSR consists of 915 million explicit statements and includes DBPedia, Freebase, Geonames, Wordnet, CIA Factbook, lingvoj, UMBEL, and the ontologies, used by them (e.g. SKOS, FOAF, and DC).
We loaded LDSR in the OWLIM semantic repository, inferring 655 million statements that have been indexed along the explicit ones (OWL dialect similar to OWL 2 RL was used). Although OWLIM performs complete forward-chaining, it does not materialize all the results – while the total number of all indexed statements is 1.57 billion, the number of retrievable statements is 3.02 billion.
LDSR is available for search, exploration, and query evaluation at http://www.ontotext.com/ldsr/. LDSR and its front-ends develop constantly in order to present the best possible way for someone to comprehend and start using the web of linked data.

2. Linked data for Personal Information Management | Girts Niedra, TenForce
It proves to be very difficult to manage the personal and public information on the web, resulting from the magnitude of portals and users requesting the same up-to-date information. In this paper the author is proposing a community - driven project, called LinkedPIM, which uses semantic technologies and open subscription protocols to make delivery of up-to-date Personal Information Management artefacts easy and reliable.

3. SKOS Extensions for the EUROVOC Thesaurus | Johan De Smedt, TenForce
The EUROVOC Thesaurus is now available in RDF.  For this purpose, EUROVOC has been modeled using OWL.  The resulting EUROVOC ontology is a SKOS and SKOS-XL extension.  The paper and the presentation details how thesaurus specific artifacts of EUROVOC are represented complying to SKOS, to SKOS-XL and to the EUROVOC specific ontology.
Attention is given to the following topics: (i) thesaurus organization in domain and micro-thesaurus; (ii) thesaurus release identification; (iii) management of the vocabulary, different equivalence relationships and permuted labels; (iv) sub-types of concepts; (v) usage of 'notes' and notes formatting (using XHTML + RDFa); (vi) documenting rules applied for generating SKOS and SKOS-XL compliant representations.
 
Copyright ©2008 STI International, All rights reserved.