Posted in March 28, 2010 ¬ 20:35h.cpf_Comments Off
On popular demand, I’m going to try and describe what I did to get the 1111 docs/sec indexing (link) on a 1.200.000 documents index. Please don’t be surprised to how little I have to do with the entire story, and how much is thanks to Solr’s great code. The Machine First and foremost, small part [...]
Read the rest of this entry »
Posted in March 16, 2010 ¬ 05:41h.cpf_Comments Off
Solr and Lucene are truly amazing things, capable of fast indexing and querying vast amounts of data. However, when coming from a conventional database structure, it’s quite hard getting to the thinking pattern Lucene uses vs SQL (e.g.) SQL: select fieldname from database The equivalent of select fieldname from database as known in SQL databases, [...]
Read the rest of this entry »