This diagram shows the data science process.
- Data is collected from sensors in the environment, represented by the globe.
- Data is "cleaned" or otherwise processed to produce a data set (typically a data table) usable for processing.
- Exploratory data analysis and statistical modeling may then be performed.
- A "data product" is a program such as retailers use to suggest new purchases based on purchase history. It can also create data and feed it back into the environment.