This post originated from an RSS feed registered with Ruby Buzz
by James Britt.
Original Post: Rollyo Search Stuff
Feed Title: James Britt: Ruby Development
Feed URL: http://feeds.feedburner.com/JamesBritt-Home
Feed Description: James Britt: Playing with better toys
Came across a site called Rollyo, which allows you to create site-specfic search engines. I whipped one up for ruby-doc, defined to search ruby-doc.org and a small set of other core-Ruby sites tat likely have good documentation resources.
I wondered if I could use this to power a search engine on ruby-doc.org itself; the Rollyo FAQ didn't say much on that. Inspired by the page's DOCTYPE of XHTML 1.0 Strict, my first quick stab was to use open-uri and REXML, but, sadly, this does not work.
So, I think I'll use Mechanize, which does a very good job of parsing bad markup.
BTW, although the site makes use of DHTML and remote scripting, one can create Rollyo search URLs much as one can for Google. You need to know the sid of the Rollyo search config you want (ruby-doc is 1526), and pass that as well as the actual search terms in the parameter 'q':