Hi Anjana,

Thank you for coming with this practical question. I have the answer for you.

“When your data is first loaded into the Transformer page, a sample of the data in your dataset is displayed in the data grid.”

“When a dataset is first created, a background job begins to generate a sample using the first set of rows of the dataset. This initial sample is usually very quick to generate, so that you can get to work right away on your transformations.”

The data you got in DataPrep table is only a small sample of source data when we ingest datasets from BigQuery to DataPrep. We need to submit it as jobs after you editing recipes. The output will consist the complete data set which could be saved as a new table in BigQuery directly.

By default, the sample is the first set of rows of your dataset. You also can generate a random selection of a subset of rows in the dataset. More details can be found Sampling and Profiling Data.

Thank you for keeping learning.

Good job!

Your AI servant,

Alina

Data Scientist: Keep it simple.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store