Ontology Part 4: Digging a bit deeper
by Jens on Nov.03, 2010, under General, Online Data Sources
Ok, so I’m now writing the 4th post centring about ontology. In reality, what I am interested in is actually how to use ontology to improve the sharing and location of data, and if at all possibly from a viewpoint of not wrecking the entire existing data models, because it’s time consuming, expensive, and quite often – let’s be realistic - just not an option.
In a slideshare post by Juan Esteva, there are some interesting issues addressed about data integration:
View more presentations from juanesteva.
To sum up my main take. There are 3 main challenges to successfull data integration – essentially independent on if you are talking about a internet based global concept, or if you are talking about disparate data sources within your own organisation (admitted there is a scalability difference!)
- Syntactic Challenges – e.g. different models and languages
- Schematic Challenges – e.g. structural differences
- Semantic differences – e.g. different meanings and understandings.
To achieve true interoperability, you should in theory address all 3, and it would be nice to be able to do so for most people. But some of these challenges will remain, no matter what conceptual nirvana you present.
Each of the challenges have their own issues. There will always be lingual differences – but they are gradually being overcome.
There will always be schematic differences. Simply because nobody will ever use the same vendor and the same solution – and they probably shouldn’t either.
But in terms of semantics, there is at least an opportunity to present the information in a structured fashion now - just like it has been possible to define your language, and your schema, you can now at least make representation of your understanding of what it is that you share. It will probably not be perfect (In fact it is almost guaranteed that someone will argue – and like with the single vendor/schema solution- perhaps that’s a good thing!) – but it does provide the opportunity to include the current level of understanding for which data is being shared.
To start approaching more “global” ontologies, there’s a next step involved i suspect, whereby classifications and entities’ relationships are defined by all the possibly used combinations, without passing judgement on the use. Instead perhaps it should merely be the strength of the usage of particular triplicates that determine your most likely representation and understanding of a concept. So we’re not barring anyone from semantically expressing that if it walks like a duck – its-a three-legged pony. But because there is an overwhelmingly more popular usage of the semantic triplicate its-a duck, we can regard it as more likely without dismissing equestrian counterparts.
I am actually working on something slightly more constructive in terms of ontology use, which will show up in a future part of this series, but it is the more philosophical aspects of the concept that still both excites and bugs me.
1 Trackback or Pingback for this entry
November 4th, 2010 on 4:10 pm
[...] This post was mentioned on Twitter by Quentin H. Reul, Jens Rasmussen. Jens Rasmussen said: Ontology Part 4: Digging a bit deeper: Ok, so I’m now writing the 4th post centring about ontology. In reality, wh… http://bit.ly/cQwLDZ [...]