Structured file ingestion is failing with java.lang.IllegalArgumentException Delimiter cannot be more than one character

Structured file ingestion is failing with java.lang.IllegalArgumentException: Delimiter cannot be more than one character

Problem Description:

Structured file ingestion failing with java.lang.IllegalArgumentException: Delimiter cannot be more than one character error, sample stack trace looks like below,

Root cause:

Spark 2 doesn't support multi-character delimiter during CSV read. The databricks runtime version(5.5) we use for the submission of the job has spark version 2.X. So by default files with multi-character delimiter will fail with the below-mentioned error.

Solution:

Spark 3 can handle multi-character delimiter so if we submit the with databricks runtime 7.2X we can avoid above mentioned error while crawling the data. Below is the advanced configuration one needs to set at the table or the source level to run an ingestion job on different runtime than the default one.

Applicable IWX versions:

IWX 4.2

Last updated on

Was this page helpful?