Posted in March 28, 2010 ¬ 20:35h.cpf_Comments Off
On popular demand, I’m going to try and describe what I did to get the 1111 docs/sec indexing (link) on a 1.200.000 documents index. Please don’t be surprised to how little I have to do with the entire story, and how much is thanks to Solr’s great code. The Machine First and foremost, small part [...]
Read the rest of this entry »
Posted in March 16, 2010 ¬ 05:41h.cpf_Comments Off
Solr and Lucene are truly amazing things, capable of fast indexing and querying vast amounts of data. However, when coming from a conventional database structure, it’s quite hard getting to the thinking pattern Lucene uses vs SQL (e.g.) SQL: select fieldname from database The equivalent of select fieldname from database as known in SQL databases, [...]
Read the rest of this entry »
Posted in February 10, 2010 ¬ 14:32h.cpf_
My internship: Global search @ ugent.be Lagging a bit behind, I’m going to describe my internship. In a nutshell, my internship is about search, a whole lot of search. Since the portal site of the university at Ghent (http://www.ugent.be) is moving to and running Plone as its main CMS and a whole lot of data [...]
Read the rest of this entry »