I am playing around with lucene and 40GB of data (~500M of tuples, 2 fields behaving like key - value). I have created -- a suprise -- a 35 GB index which does not work. Therefore I want to create a set of smaller indicies but, for that, I need information about maximum size.
相关问题
- Delete Messages from a Topic in Apache Kafka
- Jackson Deserialization not calling deserialize on
- How to maintain order of key-value in DataFrame sa
- StackExchange API - Deserialize Date in JSON Respo
- Difference between Types.INTEGER and Types.NULL in
Are you using MMapDirectory and a 32-bit VM? If so, the address space is not enough to cover the whole index and that might have caused the problem. In that case you need to use SimpleFSDirectory or NIOFSDirectory instead. Note that functions like FSDirectory.open(File) return a FSDirectory, which might or might not be a MMapDirectory.
What filesystem do you use? Are you absolutely sure that you have created a valid index? How exactly are you indexing your data?
Theoretically you shouldn't be at the maximum by a long shot.
http://lucene.apache.org/java/3_0_0/fileformats.html#Limitations