Near-optimal Monitoring of Online Data Sources

Google TechTalks
July 27, 2006

Ryan Peterson

ABSTRACT
Crawling the Web for interesting and relevant changes has become increasingly difficult due to the abundance of frequently changing information. Common techniques for solving such problems make use of heuristics, which do not provide performance guarantees and tend to be tailored to specific scenarios or benchmarks.

In this talk, I will present a principled approach based on mathematical optimization for monitoring high-volume online data sources. We have built and deployed a distributed system called Corona that enables clients to subscribe to Web pages and notifies clients of updates asynchronously via instant messages. Corona assigns…

Gmail: A Behind the Scenes Video

http://mail.google.com/mvideo

The final video is now live! Check it out at http://mail.google.com/mvideo

Help us imagine how an email message travels around the world. Take a look at the collaborative video we started, and then film what happens next. Post your clip as a response to this one. We’ll edit a selection of submissions together to make a final video, which will be featured on the Gmail homepage and seen by users worldwide.

The 4 a.m. mystery | Rives

http://www.ted.com Poet Rives does 8 minutes of lyrical origami, folding history into a series of coincidences surrounding that most surreal of hours, 4 o’clock in the morning.

TEDTalks is a daily video podcast of the best talks and performances from the TED Conference, where the world’s leading thinkers and doers are invited to give the talk of their lives in 18 minutes — including speakers such as Jill Bolte Taylor, Sir Ken Robinson, Hans Rosling, Al Gore and Arthur Benjamin. TED stands for Technology, Entertainment, and Design, and TEDTalks cover these topics as well as science, business, politics and the arts. Watch the Top 10 TEDTalks on TED.com, at
http://www.ted.com/index.php/talks/top10