可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

Now I am using Spark to connect my oracle database. However, there is a column type named "TIMESTAMP WITH TIMEZONE",which is a specific column in Oracle. When I load data from the table than contain this type column, it will throw a error "java.sql.SQLException: Unsupported type -101".

Does anybody know how to load specific columns from a table? Then I can avoid to select the "TIMESTAMP WITH TIMEZONE" column. It will be better if someone can figure out the "java.sql.SQLException: Unsupported type -101" error. But I think it might be a bug of Spark.

My code is following, thanks a lot.

spark = SparkSession\
    .builder\
    .appName("TestSQL")\
    .getOrCreate()
orc = spark.read \
    .format("jdbc") \
    .option("url", "jdbc:oracle:thin:xxx/xxx@IP:1521/database") \
    .option("dbtable", "xxx.xxx") \
    .load()

回答1:

In options , you can pass the sql query in dbtable key. In sql query you can select the required columns.

For example:

final String dbTable =
        "(select emp_no, concat_ws(' ', first_name, last_name) as full_name from employees) as employees_name";

Dataset<Row> jdbcDF = 
        sparkSession.read().jdbc(CONNECTION_URL, dbTable, "emp_no", 10001, 499999, 10, connectionProperties);