will Gremlin graph queries always perform operatio

2019-09-08 04:19发布

admittedly, most of my database experience is relational. one of the tenets in that space is to avoid moving data over the network. this manifests by using something like:

select * from person order by last_name limit 10

which will presumably order and limit within the database engine vs using something like:

select * from person

and subsequently ordering and taking the top 10 at the client which could have disastrous effects if there are a million person records.

so, with Gremlin (from Groovy), if i do something like:

g.V().has('@class', 'Person').order{println('!'); it.a.last_name <=> it.b.last_name}[0..9]

i am seeing the ! printed, so i am assuming that this bringing all Person records into the address space of my client prior to the order and limit steps which is not the desired effect.

do my options for processing queries entirely in the database engine become product specific (e.g. for orient-db perhaps submit the query in their flavor of SQL), or is there something about Gremlin that i am missing?

标签： neo4j orientdb graph-databases gremlin gremlin-server

1条回答

倾城　Initia

2楼-- · 2019-09-08 04:54

If you want the implementer's query optimizer to kick in, you need to use as many Gremlin steps as possible and avoid pure Groovy/in-memory processing of your graph traversals.

You're most likely looking for something like this (as of TinkerPop v3.2.0):

g.V().has('@class', 'Person').order().by('last_name', incr).limit(10)

If you find yourself using lambdas, chances are often high that this could be done with pure Gremlin steps. Favor Gremlin steps over lambdas.

See TinkerPop v3.2.0 documentation:

0人赞添加讨论(0) 举报

will Gremlin graph queries always perform operatio

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间