Product Documentation – Mantium Skip to content

Product Documentation

Find all the code, requirements, and tutorials to ditch the digging and start coding.

Retrieve the relevant information from API documentation, coding tutorials, or user manuals regardless of where it is or how many times you forgot where you put it.

Input

What are the arguments to the text normalization transform?

Output

The arguments to the text normalization transform are:

source_column: The column name containing the text you want to normalize.

target_column: The column name that holds the normalized text.

lower: A boolean value indicating whether to convert the text to lowercase.

normalize_whitespace: A boolean value indicating whether to remove extra whitespace between words.

strip_lines: A boolean value indicating whether to remove leading or trailing whitespace from each line.

keep_two_line_breaks: A boolean value indicating whether to keep at most two line breaks.

no_line_breaks: A boolean value indicating whether to remove line breaks.

clean_whitespace: A boolean value indicating whether to strip whitespaces before or after each line.

clean_empty_lines: A boolean value indicating whether to remove more than two empty lines.

Data Connectors

COMING SOON

Looking to code more and search less?