31st Oct 2008
Semantic kernels can mean better (broader) search: usage #1
I’ve posted a usage example of how sematic kernels can be applied to real life. For those of you, who (like me) was wondering “how the heck Semantic Web is going to be better” - here’s a small part of my view of it.
The example is “traditional” search of Twitter for Linux vs “semantic kernel search” for same phrase. It uses a small random part of kernel - it is done just because I didn’t want to load Twitter servers with hundreds of queries.
The demo is here: http://semkernel.com/tweet/linux
The idea is that if you search for all the words that are closely related to the topic - you’re going to find broader results. It’s unnatural for people to write obvoius (for them) things, like if I say: “Visited St.Basil’s Cathedral today - the weather was nasty” - it would be unnatural for me to write “Visited St.Basil’s Cathedral in Moscow, Russia today” because it’s redundant information.
Another example would be “I’ve seen Obama today” vs “I’ve seen Obama today. Obama is a democrat. Democrats are political party. Political party is politics.” - you see how dumb should be phrase so that “traditional” keyword search find the “I’ve seen Obama today” on “politics” query.
Okay, if you enter politics into Google - you would probably find Obama’s site, but… that takes a lot of people linking it with “politics” phrase. And that leaves very little choice over what you’ll find. Actually mostly no choice - you wouldn’t find phrase “I’ve seen Obama today” in any way in search engine, because nobody would be linking to it. That leaves us with mainstream news sources. That’s exactly why blogging is dying today - single bloggers can’t compete with big corporate bloggers, who can buy media love.
Ok, back to the “St.Basil’s weather” phrase. So, if you were searching for “Russia” or “Moscow, Russia” - you would NOT find my post, because there’s “implied” knowledge, which is hidden inside of a phrase. Traditional search engines can’t see it.
But if the next-gen search engine would’ve used “semantic kernels” to expand the result - you’d have found it, because keyword “basil’s cathedral” is in semantic kernel of “Moscow” (although not yet in the semantic kernel of “Russia”). But this technology is still young and developing and actually those semantic kernels are only partial of what can be achieved.
Semantic kernels can mean that search engines would find your page if it has quality information without gazillions of links. Actually linking has nothing to do at all with relevancy search. Well, it SHOULDN’T have anything to do…
There wouldn’t be “googlebombing” if linking wasn’t a major part of Google’s algo. And the way they “solved” the problem now means that you have to write “democrats are political party, which is politics” so that you can be found on “politics” query.
And the search is only one field where semantics can be a major step forward.
That one of the usages my semkernel.com is going to try to help people with. The technology is there to try.. now.
I’ve posted a usage example of how sematic kernels can be applied to real life. For those of you, who (like me) was wondering “how the heck Semantic Web is going to be better” - here’s a small part of my view of it.
The example is “traditional” search of Twitter for Linux vs “semantic kernel search” for same phrase. It uses a small random part of kernel - it is done just because I didn’t want to load Twitter servers with hundreds of queries.
The demo is here: http://semkernel.com/tweet/linux
The idea is that if you search for all the words that are closely related to the topic - you’re going to find broader results. It’s unnatural for people to write obvoius (for them) things, like if I say: “Visited St.Basil’s Cathedral today - the weather was nasty” - it would be unnatural for me to write “Visited St.Basil’s Cathedral in Moscow, Russia today” because it’s redundant information.
Another example would be “I’ve seen Obama today” vs “I’ve seen Obama today. Obama is a democrat. Democrats are political party. Political party is politics.” - you see how dumb should be phrase so that “traditional” keyword search find the “I’ve seen Obama today” on “politics” query.
Okay, if you enter politics into Google - you would probably find Obama’s site, but… that takes a lot of people linking it with “politics” phrase. And that leaves very little choice over what you’ll find. Actually mostly no choice - you wouldn’t find phrase “I’ve seen Obama today” in any way in search engine, because nobody would be linking to it. That leaves us with mainstream news sources. That’s exactly why blogging is dying today - single bloggers can’t compete with big corporate bloggers, who can buy media love.
Ok, back to the “St.Basil’s weather” phrase. So, if you were searching for “Russia” or “Moscow, Russia” - you would NOT find my post, because there’s “implied” knowledge, which is hidden inside of a phrase. Traditional search engines can’t see it.
But if the next-gen search engine would’ve used “semantic kernels” to expand the result - you’d have found it, because keyword “basil’s cathedral” is in semantic kernel of “Moscow” (although not yet in the semantic kernel of “Russia”). But this technology is still young and developing and actually those semantic kernels are only partial of what can be achieved.
Semantic kernels can mean that search engines would find your page if it has quality information without gazillions of links. Actually linking has nothing to do at all with relevancy search. Well, it SHOULDN’T have anything to do…
There wouldn’t be “googlebombing” if linking wasn’t a major part of Google’s algo. And the way they “solved” the problem now means that you have to write “democrats are political party, which is politics” so that you can be found on “politics” query.
And the search is only one field where semantics can be a major step forward.
That one of the usages my semkernel.com is going to try to help people with. The technology is there to try.. now.
Posted in ideas | Comments Off