"Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc."
New search engine. Shows mini versions of results pages, and related terms along the left. Interesting, but my blog doesn't show up on the first page of hits for frbr, so naturally I'm suspicious of its usefulness.
"MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the underly