A nice outline of metadata in HTML including SHOE, microformats, eRDF, and RDFa in the context of searching based on annotated Web content is available at ReadWriteWeb: Making the Web Searchable: The Story of SearchMonkey by Alex Iskold.
Archive for May 2008
I’ve written some thoughts on Yahoo’s SearchMonkey after having seen it live:
Yahoo recently announced SearchMonkey, and for the first time in 10 years, I have a reason to switch search engines, from Google to Yahoo (In fact, I just did that in Firefox.) Most web-savvy engineers know that online services succeed in big ways when they become platforms: when other developers can expand on the functionality in ways not foreseen by the original developers. Yahoo is the first to figure out how to do just that with a major search engine. With SearchMonkey, any developer gains the ability to provide custom ways of extracting and presenting page data within Yahoo search results.
[...]
And the best part is that Yahoo has separated data extraction from data presentation. Specifically, a data extractor for LinkedIn can produce RDFa, and different presentation applications can use different portions of that RDFa. If the extractor for Monster.com produces the same RDFa, then the same presentation application can be used to display both Monster.com and LinkedIn data. And if LinkedIn and Monster.com produce RDFa natively, within their web sites, then there’s no need to build data extraction… the presentation application can work natively on the raw web pages.
Check out SearchMonkey.
There was lots of talk about RDFa at the XTech conference. Amongst the attendees was Ian Forrester of BBC Backstage, who recorded some talks, and interviewed some attendees, including:
- Steven Pemberton’s talk Why You Should Have a Website, which covers reasons for needing RDFa
- An interview with Mark Birbeck on XForms and RDFa
- An interview with Steven Pemberton and Michael Smith on directions the Web is taking, including discussions on RDFa, in two parts: part 1 and part 2.
Digg’s RDFa support is covered in BetaNews. Bob DuCharme is quoted, and the RDFa highlighter is referenced.
At the Xtech conference next week, Steven Pemberton will be presenting his paper “Why you should have a website”.
The talk discusses some of the inherent problems with Web 2.0, and how RDFa can be used to overcome them.
The London Gazette is the UK Government’s Official Journal and Newspaper of Record, published since 1665. A new issue is published every working day with 300-500 official notices.
At her talk SemWebbing the London Gazette at the forthcoming XTech conference, Jeni Tennison will explain how the London Gazette is adopting RDFa to semantically mark up their pages.
