Is there any possible way to bulk load data using MLCP as a scheduled task in Marklogic
相关问题
- Exception while copying data using MLCP
- How can I optimize a SPARQL query that returns opt
- ms-excel compatible csv file representing all docu
- QueryOptionsBuilder Deprecation
- How to list all the URI of all documents in a dire
相关文章
- How to use Group By in Marklogic?
- Forgot my admin username and password in MarkLogic
- MARKLOGIC: Is it possible to use more than 1 colum
- Unable to create MarkLogic scheduled tasks from wi
- avoiding XDMP-EXPNTREECACHEFULL and loading docume
- Using Transform Module during MLCP Ingestion to Ma
- Updating counter in XQuery
- 地理空间查询与光学API(MarkLogic)(Geospatial Queries with Op
Scheduled tasks inside
MarkLogic
can call external services (usingHTTP
), but they don't have a way to run an external command. You do have some options:MLCP
job externally, using cron on Linux or something along those lines;JavaScript
orXQuery
; you can retrieve data from a file system, run it through some transforms, and insert it into the database using modules running inMarkLogic
;HTTP
request to that server and have the Java app server callMLCP
I think I'd start with the first option, but which one is best depends on your use case.
You can't invoke mlcp via a scheduled task; I recommend trying something like Apache Camel for this.
Camel has a Timer component and a Quartz component, either of which can be used for scheduling.
And here's an example Camel file with a route (commented out, but still operable) that is initiated by a Timer which then writes a file to disk and ingests it via mlcp - https://github.com/rjrudin/ml-camel-client/blob/master/src/main/resources/META-INF/camel-routes.xml .
I've had good success with doing all kinds of processing/scheduling in Camel and then ultimately ingesting content via mlcp. I think it's a good fit for your use case here so you can leverage what mlcp does best - get content into MarkLogic as fast as possible.