How to create a HDInsightOnDemand LinkedService wi

2019-06-07 23:29发布

We are creating a DataFactory for running a pySpark job, that uses a HDInsight on demand cluster.

The problem is that we need to use additional python dependencies for running this job, such as numpy, that are not installed.

We believe that the way of doing so is configuring a Script Action for the HDInsightOnDemandLinkedService, but we cannot find this option on DataFactory or LikedServices.

Is there an alternative for automating the HDInsightOnDemand installation of the dependencies?

1条回答
爱情/是我丢掉的垃圾
2楼-- · 2019-06-08 00:13

Currently the Script Actions for HDInsightOnDemandLinkedService are not supported. You can use Azure Automation to run a PowerShell script that does the following:

  1. create HDInsight cluster
  2. execute Script Action
  3. run pipeline in your DataFactory
  4. delete the cluster.
查看更多
登录 后发表回答