A valid character for ingest must be a character according to the XML specification.
See the Second Edition of the XML 1.0 Specification for details about valid characters.
Error: Character <c> is not legal in XML 1.0
The error message is added to the log for the run.
Only the record that includes the invalid character is rejected. The rest of the ingest operation continues.
//#CTL2 // Transforms input record into output record. function integer transform() { string regex = "([^\\u0009\\u000a\\u000d\\u0020-\\uD7FF\\uE000-\\uFFFD]|[\\u0092\\u007F]+)"; $0.YourDataCleanData = replace($YourDatawithInvalidPattern,regex,""); return ALL; }
Compatibility characters are also not valid. The code above removes compatibility characters.