Transformation script workflow

At a high level, writing a transformation script and applying it to your data involves the following steps:

  1. Write a custom transformation using custom transform functions within Big Data Discovery, or native Groovy language.
  2. Use preview mode to debug your transformation and view its effects on your data.
  3. Save the transformation to your transformation script.
  4. Edit your transformation script by rearranging, modifying, and deleting individual transformations.
  5. Apply your script to the sample data set your project was created from. This updates your copy of the project data set (it is a sample of the source Hive table), and makes it available in Discover area of Studio, where you can use guided navigation and search on it, as on any other data set in your project.
  6. Apply your script to the source Hive table your project is based on. This creates a new Hive table and adds a new data set to the Catalog in Studio.