MongoDB: How to index unknown fields

We're storing documents with a undefined structure. I mean, it has a basic structure (id, user and creationTimestamp), but is also there a Map<String, Object> values fields, where we are able to store whichever structure:

public class Metadata {
    private String id;
    private String user;
    private Date creationTimestamp;
    private Map<String, Object> values;
}

Example:

> db.metadata.find();
{
    "_id" : "Doc2Ref2Mdt1",
    "user" : "user1",
    "creationTimestamp" : ISODate("2018-09-24T12:20:56.958Z"),
    "values" : {
        "ambit" : "ctti",
        "departament" : "economia"
    }
},
{
    "_id" : "Doc1Ref2Mdt1",
    "user" : "user2",
    "creationTimestamp" : ISODate("2018-09-24T12:20:56.169Z"),
    "values" : {
        "date" : ISODate("2018-09-24T12:20:56.171Z"),
        "number" : 16,
        "address" : {
        "street" : "Av. Diagonal",
        "location" : "barcelona",
        "credentials" : [
        {
            "password" : "pwd",
            "login" : "main"
        },
        {
            "password" : "pwd",
            "login" : "other",
            "creation" : ISODate("2018-09-24T12:20:56.171Z")
        }],
        "contact" : "contact name",
        "tags" : ["tag1", "tag2"}]
    }
}

So, you can see values can store any structure.

I need to know if mongodb is able to automatically index all of them.

I mean, when a new field is "added" into values, for example, values.newfield it is indexed automatically.

Any ideas?

标签： mongodb mongodb-query mongodb-indexes

2条回答

Lonely孤独者°

2楼-- · 2019-08-23 11:37

Not possible in the way you want it.

You can try text index on all fields followed by the actual query. The text search alone may result with false-positive matches, but aggregated with the normal query it will reduce the dataset to scan and in most cases will speed it up.

There are few limitations to keep in mind:

only string fields will be indexed, e.g. .find({ $text: { $search: "2018-09-24" } }) won't return anything. Neither will .find({ $text: { $search: "16" } })
only full words queries, i.e. no regexps. You will need to use language: "none" to keep stopwords and don't use steming. Queries like .find({ $text: { $search: "barcel" } }) won't find the second document.

The first limitation can be worked around by serialisation of values map to a string field on write, if you control all writes. Then you will need to create a text index on this field only.

0人赞添加讨论(0) 举报

smile是对你的礼貌

3楼-- · 2019-08-23 11:49

you can create an index on a subdocument, and new field will be added automatically,

BUT

To use this index with your queries, you must provide as parameter complete and ordered document.

Example, with your sample :

db.metadata.createIndex({"values",1});

db.metadata.find("values.ambit":"ctti")

==> Will NOT use the index, but will return first document.

db.metadata.find(values:{ambit:"ctti"})

==> Will use the index, but return no document.

db.metadata.find(values:{"departament" : "economia", ambit:"ctti"})

==> Will use the index, but return no document, due to different fields order.

db.metadata.find(values:{ambit:"ctti", "departament" : "economia"})

==> Will use the index, and return return first document.

0人赞添加讨论(0) 举报

MongoDB: How to index unknown fields

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间