CS177-lecture-20220127

Exploring data #

Ultimately, any machine learning is just optimizing error in a graph, then deriving a function that fits to the data.

image_2022-01-27-16-32-28 image_2022-01-27-16-32-35 image_2022-01-27-16-39-26

Read more about John Tukey

Iris sample data set #

image_2022-01-27-16-41-05

image_2022-01-27-16-41-42

Statistics definitions #

image_2022-01-27-16-41-55 image_2022-01-27-16-41-59 image_2022-01-27-16-43-22

  • ordinal means that the data can be ordered
  • continuous means that the values are floating point

image_2022-01-27-16-44-38 image_2022-01-27-16-44-42

  • \( \text{AAD} (x) \) is the absolute distance
  • \( \text{MAD} (x) \) is the median dependent value

Visualization #

image_2022-01-27-16-44-51 image_2022-01-27-16-53-19 image_2022-01-27-16-55-58 image_2022-01-27-16-57-51 image_2022-01-27-17-01-01 image_2022-01-27-17-02-12 image_2022-01-27-17-07-18 image_2022-01-27-17-07-23 image_2022-01-27-17-07-27 image_2022-01-27-17-10-15 image_2022-01-27-17-10-20 image_2022-01-27-17-12-42 image_2022-01-27-17-12-48 image_2022-01-27-17-15-50 image_2022-01-27-17-15-54