Powerset

I’ve been playing around with Powerset, a new Semantic Search Engine, which uses natural language search technology that is based on patents licensed exclusively from Palo Alto Research Center (formerly Xerox) and its own proprietary indexing.

Instead of being limited to keywords, Powerset allows you to enter keywords, phrases, or questions. Instead of just showing you a list of blue links, Powerset gives you more accurate search results, often answering questions directly, and aggregates information from across multiple articles.

I have to confess I am very impressed with it – I used it for a few hours late last night to help find some references for a blog piece I’m writing and loved the way it seemed to understand the concepts I was asking about rather than giving me matches based on keywords. Currently it only searches Wikipedia but does also provide results from Freebase as well. I really find the MiniViewer useful, it allows you to view a short result snippet presented on the results screen within the other original context it was extracted from, this also makes the search results feel interactive.

Here’s the demonstration video that explains how Powerset is different using some cool examples:


Powerset Demo Video from officialpowerset on Vimeo.

I’m really interesting in seeing where Powerset goes next …

A Dream Within A Dream

       A Dream Within a Dream

Take this kiss upon the brow!
And, in parting from you now,
Thus much let me avow --
You are not wrong, who deem
That my days have been a dream;
Yet if hope has flown away
In a night, or in a day,
In a vision, or in none,
Is it therefore the less gone?
All that we see or seem
Is but a dream within a dream.

I stand amid the roar
Of a surf-tormented shore,
And I hold within my hand
Grains of the golden sand --
How few! yet how they creep
Through my fingers to the deep,
While I weep -- while I weep!
O God! can I not grasp
Them with a tighter clasp?
O God! can I not save
One from the pitiless wave?
Is all that we see or seem
But a dream within a dream?

        --by Edgar Allan Poe

Processing.js

Processing is a Open Source data visualization programming language. I first played around with it about a year ago. I was recently reminded of it by Rob, and have started playing with it again. However, I just discovered that earlier in the week John Resig released his JavaScript Port, Processing.js. So far it looks amazing, virtually all the demo/example applications that are shipped with Processing are running using the CanvasElement in JavaScript. I’m going to have a lot of fun with this.

John deserves a huge amount of credit for this contribution.

Revelation

    1913 A Boy's Will
       Revelation

WE make ourselves a place apart
Behind light words that tease and flout,
But oh, the agitated heart
Till someone find us really out.
'Tis pity if the case require
(Or so we say) that in the end
We speak the literal to inspire
The understanding of a friend.
But so with all, from babes that play
At hide-and-seek to God afar,
So all who hide too well away
Must speak and tell us where they are.

          -- by Robert Frost

Seeing sights around Beijing

After the WWW2008 conference ended me and guys stayed on in Beijing for a couple of days in order to take in some of the local sites. Armed with our camera’s and a subway map, Rob, Paul and Myself managed to find our way to Beijing Zoo and saw an assortment of wonderful animals – but for me it was the Pandas, that I had really wanted to see.

We also a section of The Great Wall in Ba Da Ling. The sheer immensity of this 6000 km structure is breathtaking. You can’t help but feel small and insignificant as you walk along the wall and and take in the view around you:

We also visited The Temple of Heaven, which is situated near the center of Beijing. There was something wonderfully serene about the temple, that made you feel as though you’d stepped from the hustle and bustle of one world into the magical calm of another. It’s a site rich with tradition and as a temple it is beautiful to behold.

We also walked through Tiananmen Square. In some ways it felt surreal, as we walked through it taking pictures, I couldn’t help but recall as a teenager seeing images on television of students demonstrating and vividly remember seeing images of tanks bearing down on demonstrators. Yet as I looked around there were thousands of people, mostly chinese, milling around the square, there were also many tourists everyone seemed cheerful, it was a far cry from those images on TV:

We tried to visit The forbidden city, but unfortunately by the time we arrived there were hundreds of people queuing to buy tickets to get in and because it would have taken at least an hour to queue there we opted not to given we only had a few hours left. So we decided to walk around the grounds outside The Forbidden city and take some pictures:

Following that we finally visited The Summer Palace, which for me was the highlight of the trip. It is one of the most beautiful places I have ever been. We took a boat out on the lake and took some pictures of the palace:

I loved the visit to China. I wish I could think of a bad thing to say but I really can’t – everyone we met was polite, approachable and kind; the sites we saw were amazing; the food was great!. I must confess there was a huge difference between the China I experienced and the China that is reported in the media here in the UK – which is the real China? I honestly can’t say, but I will cherish my memories of the place. As usual all my photo’s are on flickr, but I should warn you that I have uploaded roughly six hundred of them (out of the 1500 I took!). To make things easier if you click on any of the photos above they’ll show you pictures tagged with that specific location. Or if your prefer you can view the whole lot here.

You can also see Rob’s pictures here and you can see Chris’ pictures here.

WWW2008 – Day Four

The day got off to a very slow start I wasn’t feeling very well so I missed the keynote. I made my way to the Dev Track and got there in time for the first session. The opening talk was by Giovanni Tummarello on Semantic Web Pipes which seemed like a great idea. Semantic Web Pipes are analagous to the Yahoo! Pipes framework but differ in that the Yahoo framework uses RSS and the item lists in RSS do not lend themselves to the graph based modeling that RDF has. Giovanni explained that with Semantic Web Pipes you can fetch, mix and process RDF files published on the Web because the output of a Pipe is an HTTP retrievable RDF model, simple pipes can also work as inputs to more complex Pipes.

There were a couple of other memorable talks one by Rajat Agarwalla on Scrabulous, in which he explained how and why they developed the game., and what they experienced having to scale up very rapidly to satisfy the phenomenal demand they had. The one thing Rajat said that resonated deeply with me was (im paraphrasing here because I can’t fully remember it) “if you build something you love you’ll make money eventually, but if you build something just for money then you won’t“. This sentiment echoes what Confucius once said:

"Choose a job that you love and you will never
     have to work a day in your life"
              - Confucious

The other talk that both Rob and I wanted to listen to was by the Sebastian Kruk on MarcOnt Mediation Services. He told us that they had completely rewritten the MarcOnt portal and it should now be available, but it wasn’t clear to me that they were planning to do anything with Marc21 or that they had engaged with any of the other people working in this area, which was disappointing.


There were other talks during the day, after which we all got carted off to the evening Banquet at the Great Hall of the People in Tiananmen Square. Our colleague Jingye had told us how rare it was for anyone to be invited to dine there and that it was a very special venue – and he was right! My only regret was that I didn’t take my camera or phone with me. We had been told earlier in the day that camera’s weren’t allowed and that phones with camera’s might not be allowed either, so we decided to leave them in our hotel rooms. When we got there it was apparent many others had brought theirs with them and no-one was stopping them taking pictures – so I was kicking myself all evening.

Sir Tim Berners-Lee gave his keynote before the banquet and shared with us his hopes for the WWW, many of which seemed to be centered around the Semantic Web and the Linked Data movement, both of which we at Talis are heavily involved in. Tim talked about the need to get better at communicating our ideas and to engage with each other more., about breaking down barriers and get better at communicating with each other across both the physical and social/political borders and barriers that exist. Paul has a done great writeup of the keynote over on ZDNet, so I won’t dwell on it too much – suffice to say that I thought it was great keynote, and quite inspirational in some ways.

After the keynote the banquet and the evenings entertainment got under way. The food was great as were the performances, particularly one by the Peking Opera. We all had a great time. It was a very memorable evening.

WWW2008 – Day Three

The full WWW2008 Conference got under way yesterday with a Opening Ceremony that involved music and dancing and fancy lights. There were some introductions from key dignatories, before the Plenary Keynote by Google’s Dr Kai-Fu Lee. He talked about Cloud Computing and explained how it liberates the user from having to remember where data is, how it enables the user to access information from anywhere, and how cloud computing makes services faster and more powerful by leveraging what could be an infinite amount of information and massively scalable infrastructure. He explained at a high level how Google uses thousands of small cheap servers rather than fewer large expensive servers in order to create a massive network of commodity servers that form its cloud, he explained how replication works and how every piece of information is replicated to at least three different servers in this cloud. we also learned a little about how Google’s Translate service works and saw some examples of how and why its better than many of the alternatives out there.

After the keynote I went to to the Panel discussion on The Future of Online Social Interactions: What to Expect in 2020. The discussion was about how Online social interactions will evolve in the next decade – I didn’t get a lot out of this discussion it was hard take bits of it seriously, I found the thought that one might review someones rating, before allowing them to come over and sleep on one’s couch just a little far fetched … or am I just too cynical?


After lunch I attended the W3C Track, in which Tom would be giving his presentation on Linking Open Data. Tom presented this with Chris Bizer and they both did a great job conveying both the importance of opening up data and then linking data. Following Tom’s talk Raphael Troncy gave a short presentation on Semantic Multimedia, most of which I had already seen earlier in the week during his tutorial session on the same subject, nevertheless it was a good presentation. The final talk in the session was given by Huajun Chen on Semantic Web Development in China. This was a really useful talk that gave us an insight and a sense of how much Semantic Web research is going on in China – which was encouraging to hear and also very impressive.

Following the afternoon break I went to the Panel Discussion on WWW in China. This proved to be very useful, not only did we get a sense of the kind of WWW research going on in China we get a very real sense of how much academics from the various universities represented by the panelists want to engage with their colleagues outside of China to further their research. Some of the specific research discussed included Spam Filtering, Search Clustering, Modelling and Mining the emergent web etc. After the talk I got a chance to talk briefly with one of the panelists, Professor Xiaoming Li (pictured), from Peking University and suggested to him that the data he and his colleagues where capturing and mining from the web might be ideally suited to being modelled in RDF and Linked to other sources which he seemed to think was an interesting idea. I offered our help which he seemed to welcome 🙂

Since this was the final session of the day we headed back to the hotel. Rob suggested that he wanted to skip going to the evening Reception in favour of exploring Beijing on foot and taking some pictures. Chris and Paul decided that they wanted to go to the reception and meet up with Tom ( and make sure he came out with us for a bite to eat 😉 ) so I decided to go on walkabout with Rob. Turned out to be a great decision – I haven’t had so much fun in ages.

I don’t know how far we walked but we were walking for a couple of hours, we explored the area around the new National Stadium, and whilst it is still a building site no one said anything to us in fact everyone was really polite and very welcoming. I have traveled a great deal in my life and it’s very rare that I’m out and about anywhere and don’t feel that I need to be alert in fact I’m usually always aware of my safety. Yet, yesterday, as we wandered around with our Camera’s in our hands, snapping pictures of anything and everything at dusk and into the night, I never once felt unsafe or felt the slightest bit of anxiety – I loved that. I haven’t had so much fun in very long time. I also learned a lot about my buddy Rob, he was behaving like a school kid at times, climbing up onto pallets of bricks, or onto a crane, to take pictures, willing to walk to anyone and say hello, At one point late into the evening we stopped near the stadium and Rob was trying to ask a bunch of young police officers if they could give him permission to get onto something higher to take a picture. Obviously they didn’t understand english and he doesn’t speak a word of Chinese and yet we were all laughing as they tried to translate what he was saying.
It was good to see him laughing so much …sometimes I think Rob thinks too much about things, we are both very similar in that respect, we can become hugely introspective … but I know we all need to let our hair down from time to time and just have fun … and we did! As some of the pictures show.

Later in the evening Paul, Chris, Tom, Denny and Gloria met up with the two of us and we all headed to a restaurant that Rob and I had passed on our walk that we thought looked nice, it turned out to be a wonderful place and we all had a wonderful meal, before heading back to the hotel. Rob and I did walk back over to the Stadium to take some more pictures and this is when he got permission to climb up onto one of the cranes on the building site to take a few pictures.

All in all I had a great time at the conference yesterday and one of the most memorable evenings I have had in a very long time exploring a strange new city with a great friend.

I have just uploaded just under 200 more pictures to my flickr account here: http://www.flickr.com/photos/kiyanwang/

WWW2008: Day 2 – LDOW2008 Workshop

We spent all of yesterday in the Linked Data on the Web Workshop. It was quite an intense day with 27 different presentations, most of which were paper presentations in addition to a few demo’s. It was an excellent workshop so full credit to everyone who helped organise the event.

The workshop began with some short introductions by Sir Tim Berners-Lee, Chris Bizer and our very own Tom Heath. Both Chris and Tom did a great job chairing the workshop during the day and deserve credit for their efforts. After the introductions we went straight into presentations. I won’t try to describe every talk because there were so many and all of them were very good. I just want to talk about some of the highlights for me during the workshop.

Linked data is the Semantic Web done as it should be, the Web
done as it should be.
       Sir Tim Berners-Lee

For me this single statement by Tim, as part of his introduction to the workshop, captures the importance of the whole Linked Data movement. The vision of the Semantic Web cannot come to fruition unless we have linked data, as Tim pointed out back in 2006:

The Semantic Web isn't just about putting data on the web. 
It is about making links, so that a person or machine can explore 
the web of data.  With linked data, when you have some of it, you 
can find other, related, data.

Unsurprisingly every one of the presentations in the workshop aimed to describe technologies, processes, techniques and examples of linking data together semantically, to help make this vision a reality. There are many obstacles to being able to do this, some of these obstacles are technical but others are social and legal (You can view the workshop schedule here and download all of the papers), and we need to understand them all.

We Talisian’s did a couple of presentations during the workshop. I was originally supposed to present the Semantic Marc paper with Rob, but we only finished the slides the night before and decided it would be easier if he presented it without an interruption to change speakers. This proved to be the right decision since he did an excellent job, and we got some great feedback from many of the attendees.

Paul also did a presentation on Open Data Commons, his presentation was, to my mind, was far more important because I don’t believe the Linked Data community has fully understood why there is a need to license data. His presentation led to an interesting discussion and I was surprised to see that there were some people who did not understand why this was such and important issue. From what I recall the canon of their argument was that we have thousands of mashups out there re-using and re-mixing data at the moment so why do we need a Open Data Commons? RDF Book Mashup was cited as an example. What amused me was that it’s widely accepted that the RDF Book Mashup violates Amazon’s Terms of Use. Those arguing against the need for Open Data Commons were seemingly confusing that with the fact that so far Amazon hadn’t chose to do anything about RDF Book Mashup. This misses a fundamental point, Amazon doesn’t necessarily care about what people do with mashups because these are not commercial products. If someone took RDF Book Mashup and used to deliver a rival service to Amazon, I suspect that Amazon would act, and they would be well within their right to do so. Open Data Commons provides protection for Data Providers by giving them a mechanism, like the various OS Licenses did for the Open Source Community, to state under what terms people can use their data. Reciprocally it provides protection for those consuming the data since we know the terms under which the data has been made available to us. This notion that just because existing data providers haven’t sought remedial action against those that abuse their terms of service, we don’t have to worry about anything and don’t need the protection that Open Data Commons provides is naive at best and at worst it could cause the kind of damage that would make it very difficult to create this web of linked data. I guess the linked data community needs to mature in the same way the Open Source community did and also the Creative Commons community did.

One of the major themes that across in many of the talks, which was also a central theme in our paper, is how to handle disambiguation. There were a number of presentations that touched upon this issue most memorably the presentations by Alexander Passant on The Meaning of Tags, and Affraz Jaffri’ presentation on Uri Disambiguation.

I was also impressed by Jun Zhao presentation on Provenance and Linked Data in Biological Datawebs, I was fortunate enough to visit HP Labs last year and see Graham Klyne, one of her colleagues, present some of their work and it’s great to see how well they are doing.

I was also impressed by some of the work that Christian Becker has been doing with Chris Bizer on DBPedia Mobile a location centric DBPedia client application that uses a really cool Fresnal based Linked Data browser. Peter Coetzee’s work on SPARQPlug was also very impressive, and I’ve made a mental note to have a play with it as soon as I get back to the UK.

I could carry on and on but I think it’s sufficient to say it really was a wonderfully useful workshop, and I thoroughly recommend reading all of the papers that were presented.

Oh and I have just realised that Rob has posted up his thoughts about the workshop here.

WWW2008 Day One

Yesterday was our first full day in Beijing, the morning was spent transferring our things from one hotel to the actual conference hotel, by the time we were checked in, it was time for lunch. After lunch we attended a couple of the tutorials. I’ll be posting up details of the tutorials and talks I attend during the conference over on our Nodalities blog

I’ve also uploaded the first set of photos I’ve taken to my flickr account which I have organised into this set, enjoy!.