Length of parsed input (4097) exceeds the maximum number of characters defined in your parser settings (4096)

Length of parsed input (4097) exceeds the maximum number of characters defined in your parser settings (4096)

Problem Description: Sample Data/Schema crawl option for a table fails with the below ERROR for a CSV source on the User interface

Copy

Root cause: This is a limitation from the Univocity parser that Infoworks uses to parse the CSV during the Sample Data run. The maximum characters per column value is set to 4096 by default causing this issue.

Solution:

a) Login to Infoworks Edge node as Infoworks user and add the below config in the conf.properties file to increase this limit (this configuration will not be there by default in the file).

csv_max_column_char_size=15000

Set the value by checking the maximum length of the value of a string column. In my case, I have used the below excel function to identify the max length of a string column value.

b) source /opt/infoworks/bin/env.sh

c) Restart Ingestion service.

/opt/infoworks/bin/stop.sh ingestion && /opt/infoworks/bin/start.sh ingestion

d) Go back to the table and click on Sample Data. You should be able to see the sample data as shown below.

Applicable IWX Versions:

v4.x,v5.x

VariableType to search · ESC to discard
GlossaryType to search · ESC to discard
InsertType to search · ESC to discard
No matches