AP Computer Science Principles Unit 2 Chapter 2

Data

the raw, unorganized facts that need to be processed

Information

data that is processed, organized, structured, or presented in a given context to make it useful

Hypothesis

a proposed explanation for some phenomenon used as the basis for further investigation

Trending

an online topic that is quickly growing in popularity

Digital Divide

the variation in access or use of technology by various demographic characteristics (e.g., race, income, education, age, disability, and/or geography)

Visualization

data provided in a graphical format to facilitate understanding or to communicate a message (i.e. translate data into useful information)

Computational Tool

a computer-based tool or program used to create a computational artifact (e.g., a visualization, a graphic, a video, a program, an audio recording)

README

a document providing background information about a dataset

CSV (comma-separated values)

a widely used format for storing data

Raw data

the original data as it was collected

Cleaning data

making data ready for computational analysis which can include correcting or deleting invalid values and categorizing free-text data

Aggregation

A computation in which rows from a data set are grouped together and used to compute a single value of more significant meaning or measurement. Common aggregations include: average, count, sum, max, median, etc.

Pivot table

in most spreadsheet software it is the name of the tool used to create summary tables

Summary Table

a table that shows the results of aggregations (groupings of items) performed on data from a larger data set, hence a "summary" of larger data. Spreadsheet software typically calls them "pivot tables

metadata

data that describes other data

scalability

the capability of a system to expand to handle a growing amount of work