Searching Binary files in marklogic

2019-07-01 18:14发布

Is there any method to search content of binary files like PPT, PDF etc other than

converting it into X HTML format using xdmp:document-filter() function and searching on it ?

1条回答
可以哭但决不认输i
2楼-- · 2019-07-01 18:30

Basically no. You have to pull out the readable text out of the binary format to allow MarkLogic to index it. You can extract that text with xdmp:document-filter() or with functions like xdmp:pdf-convert() and xdmp:word-convert(), but there is no way to index binary nodes directly.

HTH!

查看更多
登录 后发表回答