jump to navigation

A simple demonstration of the power of semantic web using SPARQL January 24, 2013

Posted by Anand Mallaya in technology.
Tags: , , , , ,
add a comment

I want to find the actors of the movies in which two popular male actors of South India acted together.
The task is to find movies in which Mammootty and Mohanlal worked together and other actors in each of the movies.

Try googling it.

Now click on the link here
The link points to a SPARQL query to do the same! And look at the results on the bottom of the page!
bingo!
Welcome to Semantic Web!
Now graph search is invented by Facebook, or is it?

For more info Check out : dbpedia.org

Semantic Web "Layercake" (2006)

Semantic Web “Layercake” (2006) (Photo credit: Wikipedia)

Enhanced by Zemanta
Related Posts:

Videos on Semantic Web and Linked data July 25, 2010

Posted by Anand Mallaya in internet, technology, web.
Tags: , , , , , ,
add a comment

TED talks – Tim Berners Lee On the Next Web

TED -Talks – Tim Berners-Lee: The year open data went worldwide

TED Talks – Hans Rosling on HIV: New facts and stunning data visuals – A use-case of Linkeddata

Tim Berners lee talking about at Gov 2.0 Expo 2010

The Semantic Web of Data Tim Berners-Lee

practical semantic web – creating a catalog of Linked data April 4, 2010

Posted by Anand Mallaya in computers, internet, technology, tutorial, web.
Tags: , , , , , , ,
6 comments

Today I am going to work on creating a semantic web document. I am going to make a catalog of linked data datasets listed under linkeddata.org. Here is the list I am going to publish in RDF.

  1. choose the correct vocabulary – there are different generic vocabularies like Dublin Core, FOAF etc and specialized vocabulary like Dcat, Void etc. for creating catalogs. Dcat is designed for government data catalogs, so I choose VoID vocabulary, which is designed for single dataset provider. It uses generic vocabularies lke FOAF and DC as well.
  2. Select suitable tools – Tools to edit the RDF document. There are plenty of them ,in this case and RDF editor, like the Rhodonite tool for RDF editing and browsing. But I couldn’t understand it well because of poor documentation and help. So I chose an online VoID editor from DERI Galway. Though the result is in Turtle format, there are tools to convert turtle document to RDF/XML format. Like this one online : RDF Validator/converter rdfabout.com
  3. Creating the semantic grpah – first I am going to choose a dataset and add it to my catalog. To start with, I chose CrunchBase entry listed in linkeddata.org.  Go to VoID editor and add the following details in it
  4. Dataset URI:  http://cb.semsol.org/

    Dataset Homepage URIhttp://cb.semsol.org/

    Dataset Name:  Crunchbase

    Dataset Description:  RDFized Crunchbase entries

    Example Resource:  http://cb.semsol.org/company/yahoo

    Dataset Topic : business, database

    Vocabulary URIhttp://www.w3.org/1999/02/22-rdf-syntax-ns# , http://www.w3.org/2003/01/geo/wgs84_pos#,  http://cb.semsol.org/ns#

    Publisherhttp://semsol.com

    SPARQL endpoint : http://cb.semsol.org/sparql

    Now the dataset entry for cruchbase dataset is ready in VoID vocabulary, in the right side textarea. It is in Turtle notation.

    @prefix dcterms: <http://purl.org/dc/terms/&gt; .
    @prefix void: <http://rdfs.org/ns/void#&gt; .
    @prefix : <#> .
    ## your dataset
    <http://cb.semsol.org/&gt; rdf:type void:Dataset ;
    foaf:homepage <http://cb.semsol.org/&gt; ;
    dcterms:title “Crunchbase” ;
    dcterms:description “RDFized Crunchbase entries” ;
    dcterms:publisher <http://semsol.com&gt; ;
    void:sparqlEndpoint <http://cb.semsol.org/sparql&gt; ;
    void:vocabulary <http://cb.semsol.org/ns#&gt; ;
    void:exampleResource <http://cb.semsol.org/company/yahoo&gt; ;

    @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt; .@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#&gt; .@prefix foaf: <http://xmlns.com/foaf/0.1/&gt; .@prefix dcterms: <http://purl.org/dc/terms/&gt; .@prefix void: <http://rdfs.org/ns/void#&gt; .@prefix : <#> .
    ## your dataset<http://cb.semsol.org/&gt; rdf:type void:Dataset ; foaf:homepage <http://cb.semsol.org/&gt; ; dcterms:title “Crunchbase” ; dcterms:description “RDFized Crunchbase entries” ; dcterms:publisher <http://semsol.com&gt; ; void:sparqlEndpoint <http://cb.semsol.org/sparql&gt; ; void:vocabulary <http://www.w3.org/1999/02/22-rdf-syntax-ns#&gt; ; void:vocabulary <http://www.w3.org/2003/01/geo/wgs84_pos#&gt; ; void:vocabulary <http://cb.semsol.org/ns#&gt; ; void:exampleResource <http://cb.semsol.org/company/yahoo&gt; ; dcterms:subject <http://dbpedia.org/resource/Database&gt; ; dcterms:subject <http://dbpedia.org/resource/Business&gt; .

  5. Convert in to RDF/XML file(serialize) – copy the dataset details in turtle notation and go to RDF/XML converter tool. And paste the turtle notated content there and select input format as N-Triples/Turtle, and click validate. The result is given below.
  6. <?xml version=”1.0″?>
    <rdf:RDF xmlns:foaf=”http://xmlns.com/foaf/0.1/&#8221; xmlns:void=”http://rdfs.org/ns/void#&#8221;
    xmlns:rdf=”http://www.w3.org/1999/02/22-rdf-syntax-ns#&#8221;
    xmlns:dcterms=”http://purl.org/dc/terms/”&gt;
    <void:Dataset rdf:about=”http://cb.semsol.org/”&gt;
    <foaf:homepage rdf:resource=”http://cb.semsol.org/&#8221; />
    <dcterms:title>Crunchbase</dcterms:title>
    <dcterms:description>RDFized Crunchbase entries</dcterms:description>
    <dcterms:publisher rdf:resource=”http://semsol.com&#8221; />
    <void:sparqlEndpoint rdf:resource=”http://cb.semsol.org/sparql&#8221; />
    <void:vocabulary rdf:resource=”http://www.w3.org/1999/02/22-rdf-syntax-ns#&#8221; />
    <void:vocabulary rdf:resource=”http://www.w3.org/2003/01/geo/wgs84_pos#&#8221; />
    <void:vocabulary rdf:resource=”http://cb.semsol.org/ns#&#8221; />
    <void:exampleResource rdf:resource=”http://cb.semsol.org/company/yahoo&#8221; />
    <dcterms:subject rdf:resource=”http://dbpedia.org/resource/Database&#8221; />
    <dcterms:subject rdf:resource=”http://dbpedia.org/resource/Business&#8221; />
    </void:Dataset>
    </rdf:RDF>

  7. Now repeat the above process and create entries for all the datasets listed in the Linkeddata.org.
  8. Combine the RDF entries in to a single file, no need to copy all the tags but starting from <void:Dataset .. >only
    something like
    <?xml version=”1.0″?>
    <rdf:RDF xmlns:foaf=”http://xmlns.com/foaf/0.1/&#8221; xmlns:void=”http://rdfs.org/ns/void#&#8221;
    xmlns:rdf=”http://www.w3.org/1999/02/22-rdf-syntax-ns#&#8221;
    xmlns:dcterms=”http://purl.org/dc/terms/”&gt;
    <void:Dataset rdf:about=”http://cb.semsol.org/”&gt;


    </void:Dataset>
    <void:Dataset rdf:about=”http://dbpedia.org/”&gt;
    ….
    ….
    <void:Dataset>
    <void:Dataset rdf:about=”http://www.geonames.org/”&gt;


    <void:Dataset>
    </rdf:RDF>
  9. And the catalog is ready in RDF/XML with all the datasets added. Save it as a file with extension .rdf like Linked_data_catalog.rdf.  Now the machine can understand (if programmed wisely) to some extend what data are available there to convert in to meaningful information and then to  knowledge.
    You can find the catalog here: Linkeddata datasets catalog(note: not yet ready, comeback later )

10 exciting Science/Technology events of 2009 December 31, 2009

Posted by Anand Mallaya in technology.
Tags: , , , ,
1 comment so far
This is a list of  10 exciting events of 2009 related to Science/Technology.

1.Google Chrome OS announced (Tech)

Google announced its entry in to the Operating System world with its Chrome OS. A straight display of their ambition to challenge Microsoft and Apple. Google’s entry to cloud computing.

2.Large Hadron Collider revived (Science/Tech)

The biggest experiment in the history of human civilization revived after initial set backs. Conducting experiments setting record on fastest paricle acceleration

3. Launch of WolframAlpha (Tech)

An unconventional but still mysterious concept called computational knowledge engine dubbed WolframAlpha is a promise of technology and a new way to deal with the web of data.

4. Robot does science experiment(Tech)

A break through in Artificial Intelligence. A robot, called Adam, successfully conducted scientific experiment and discovered scientific knowledge for the first time. Another promise of technology. BBC news article

5. Indus valley script decoded(Science/Tech)

Using pattern recognition technologies, researchers were able to identify language like patterns in one of the earliest writing system of human civilization. Wired.com article

6. Chandrayan Mission (Science/Tech)

India successfully launched its prestigious indigenously developed moon mission.

7. Water found on Moon(Science/Tech)

Convincing evidence of presence of water on moon discovered by equipments mounted on the Chandtrayaan mission and again by LCROSS mission by NASA.

8. Evidence of Dark matter(Science)

Dark matter detected in scientific experiments. Gurdian news

9. Nuclear battery(Science/Tech)

Tiny nuclear batteries were demonstrated by scientists. BBC news article

10. Ice found on Mars(Science)

The Phoenix Mars Lander of NASA came up with evidence for water ice on Mars polar region. Space.com report

This list is my personal compilation. There may be more significant events that are not included. Please feel free to share your events.

140 character Twitter mystery – the answer October 12, 2009

Posted by Anand Mallaya in internet, social media, technology, web.
Tags: , , , , , , , ,
add a comment

In the previous post we have looked at anatomy of a tweet to find answers to the 140 character mystery. It quite nagged me for a while. My geek mind was losing to find an explanation. Today I found a solution. It has nothing to do with the twitter protocols probably, but some thing simpler. Twitter was created with a goal of a simple interface for near-real time communication. With a ubiquitous access to the service.

They gave all kind of user interfaces, like a web based one for web browser users,  APIs for web application developers- so that developers can create applications easily, and for mobile phones through text messages.

Guess what? Now you may have got a clue.

The text message or SMS was a protocol initially designed to optimally use the telephony based GSM mobile technology. The key idea for SMS was to use this telephony-optimized system and to transport messages on the signaling paths needed to control the telephony traffic during time periods when no signaling traffic existed. The message length was limited by the signalling protocol used at that time. Initially it was 128 bytes and later changed to 140 bytes length.

Now Twitters’ designers may have also targetted text message users, the most widely used data application on the planet.

SMS user stats

SMS user stats

Hence my answer.

To find the real answer we need to contact Twitter engineers..

Phew I will get a good sleep today :p

Anatomy of a tweet – 140 character mystery August 15, 2009

Posted by Anand Mallaya in computers, internet, social media, technology.
Tags: , , , , , , ,
2 comments

Everybody tweets now-a-days. Even cats tweet(@sockongton). A tweet is a burst of 140 characters from all those interesting ‘characters’ around the globe, describing ‘what they are are doing’. I wonder why just 140 characters. why not 130 or 150. So I started digging and the search lead me to check the technical anatomy of a tweet. And it gave some interesting facts about twitter. I will put the tweet anatomy examination report.

  • Number of charactr in a tweet is 140
  • Total content length(XHR request) when a tweet is 140 exactly(maximum length) – 262
  • Minimum content length(XHR request) when a tweet is one char long – 107
  • Meaning XHR header content length is 106 byte- a constant

So I checked for an answer with twitter API response

We can check a users’ status using User/show APIhttp://twitter.com/users/show.xml?screen_name=anandcv

The response is an xml file( given below is the response of my user id @anandcv when my last tweet was 140 char long)

Twitter user/show API respone

Twitter user/show API respone

Ther response got an interesting field in the status – <truncated>false</truncated>.

Probably a remainder of early implementation, which allowed more than 140 characters. Currently twitter web interface allows only 140 chars.

The other fields are

<created_at></created_at> – the time stamp
<id></id>  – the unique message id
<text></text> – the tweet itself
<source></source> – source of the tweet (from web or other tools etc..)
<in_reply_to_status_id/> – Replying to some tweet?
<in_reply_to_user_id/> –  Replying to somebody user id?
<favorited>false</favorited> – favorited?
<in_reply_to_screen_name/> – screen name of the user.

No clues yet. The search continues…

Reblog this post [with Zemanta]

Zii StemCell computing revisited- Plazma and Android power August 7, 2009

Posted by Anand Mallaya in computers, computing, multimedia, technology.
Tags: , , , , , , , , , , ,
add a comment

Some time back we have seen the mystery of the Zii StemCell computing being unravelled.  After that many were wondering what Creative is up to with the chips. Now they have come back with Zii EGG Handheld StemCell Computer that Supports the Plaszma OS or Android™ OS . Good move !.  Android hmmm!

The platform boasts multi-gesture touch display(cool!) , Text-to-speech Feature, Hardware GPS Receiver, Wifi, 32GB of internal memory and up to 32GB of external memory, Bluetooth, USB 2.0 and accelrometer.
They are heard of iPhone.  No 3G? shame! Check dictionary for competence. Get USB 3.0 dudes.

Plazma OS, based on the linux 2.6 kernel is said to show some gimmicks like 3D grphics (‘ in a battery powered small handheld’),  advanced media and communication capabilities.  But they say it is not a phone but a computer! in 108 gms.  Hmmm! I wonder.

Also support a Zii optimized Android OS. Supposedly to harness the power of the StemCell computing technology of the ZMS-05 processor.

Added value – bulk prize of 199$ per piece price tag plus  lot of development support and  market ready design.

Gonna rock or bust?! Its all about the all-knowing crowd and the economy. 😉

Reblog this post [with Zemanta]

Twitter – the dawn of the real-time web July 5, 2009

Posted by Anand Mallaya in internet, technology, web.
Tags: , , , , ,
add a comment

Twitter.com is omnipresent in the web.  Since its inception in 2006, in a short time span Twitter has grown to such a magnitude that Google got scared. The reason – collaboration and real-time. The incredible combination resulted in phenomenal popularity. The service is very simple and useful and  became omnipresent in the web.

image source:  Google trends

If  google discovered the power of data, Twitter discovered the real-time. In this fast world real-time means a lot. And data is powered by the evergrowing crowds. The API(Application Programming Interface) supported by twitter offered the data open to the technical community. And that proved just right thing to do. The data was used in a very creative ways.

From stock trading (Stocktwits) and real estate(TweetLister) to diet and exercise(Twackit), twitter found a long list of applications.

And twitter is attracting a large number of famous people from celebrities, writers and millionaires to politicians and leaders, all finds time to tweet.

Let us wait and see what wonders Twitter will bring in forecoming days.

Numbers of wonder part 1 February 15, 2009

Posted by Anand Mallaya in mathematics, technology.
Tags: , , , , , ,
1 comment so far

Wonder what numbers has to do in a tech blog? I will tell you.
In a series of posts we will explore a group of numbers that are fascinating and mysterious. Let us start with prime numbers.

Prime Numbers

A prime number is a number which has only two factors, 1 and the number itself. In other words it is divisible only by 1 and itself.
2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37, 41, 43, 47, 53, 59, 61, 67, 71, 73, 79, 83, 89, 97….
Prime numbers are enigma not only for the school children but the top mathematicians and even the best computing machines of our time. I was fascinated by the beauty of the numbers many times. They are the building blocks or atoms of all other numbers. Any other number can be represented as a product of two or more prime numbers. The fact that they are still mysterious even in the time of super computers of teraflops of computing power, makes them more enigmatic and appealing for the world’s top brains.

Though the distribution may seem random, they forms special patterns when arranged in certain manners(image below).

(image courtesy http://www.franken.de)

If it is not fascinating then what is?. It looks as mysterious as theNazca Lines .

Nature has a special place for numbers in its creativity. You will see numbers like Golden ratio, Fibonasci numbers, pi, e, and Phi. Suddenly it comes to my mind that Prime numbers too have significance that we have not found yet(or may be I). Though the numbers are easy to understand, their identification and computation are done using complex algorithms that takes lot of computing power, especially when the number of digits are high. Thus some of those algorithms are used to benchmark computer chips.
In my point of view,  there are lot more to be revealed about the prime numbers. The largest prime number revealed to us can be seen as a benchmark for the level of technological advancement of human civilization itself.
Prime numbers are important in computing as well. Several public-key cryptography algorithms, such as RSA, are based on prime numbers .
They are so important that there are a number of distributed computing projects initiated in order to find large prime numbers that require large amount of computing power.
Some of the projects are:

Here are some links for more information
Prime numbers on Wikipedia
Patterns in Primes
Prime Numbers at MathWorld
Article from The Seed magazine
Some Results of Computational Research in Prime Numbers
End of Numbers of wonder part 1
more numbers to come

Add to Technorati Favorites

Reblog this post [with Zemanta]

The Zii stemcell computing mystery finally solved January 8, 2009

Posted by Anand Mallaya in computers, computing, multimedia, technology.
Tags: , , , , , , , , ,
1 comment so far

After the specultions and hype campaigns Creative Technology Ltd. today revealed the mysterious- the so called “Stemcell computing” platform. It turns out to be an architecture for a new kind of multimedia SOCs(System-on-chip). There are two processors the ZMZ05 and DMS05. The name stemcell is used to represent the flexibility of the architecture to progammability. Stem cells are the embryonic cells which turnes in to all other kind of cells in a organism later. The device is based on programmable arrays called Processing Elements. These are said to be flexible and completely programmable and can be used to develop many type of applications with minimal additional hardware like stem cells.

– Flexibile Media-Rich SoC

– Software Defined Silicon acceleration

– Complete Platform Solutions

– Ready-for-Market Platform Solutions and Development Kits

– Advance Middleware

– A rich-suite of development tools and middleware

– Incredible Scalability – from 10 Gigaflops to Peta FLOPS

– Energy Efficient – High density processing architecture for reduced power consumption

For detailed specifications refer the link: ZMS05

After many speculations about the Stemcell computing, the hype is settled now. According to Creative:

“With an investment that spans over 10 years, over a billion dollars and over 10,000 man-years, the world is poised to experience a breakthrough that will better lives beyond our imagination.” 

The multimedia processor may be targetting from handheld devices to highend gaming consoles and boshes teraflops of scalable performance. Something that can be comaprable to supercomputers. We can expect some great stuff like the Nintendo Wii or Apple iphone like gadgets or like Sony playstation like consoles. Array of Playstation 3 processors already claimed of attaining supercomputing performance(story). Some application areas mentioned in the Zii website are

– PMP, PND and MID

– NetBook – Video Conferencing

– Video Surveillance

– Smartphone and PDA

– IP Media

– Medica

l – Embedded Systems

Let us wait and see what wonders the Zii will do to the world of computing.