Introduction to Machine Learning
ï‚· Machine Learning: the study of computational mechanisms that "learn" from data in order to
make predictions and decisions
o Statistical data-driven Computation Models
o Real domains (vision,speech behavior):
ï‚· No E=MC^2
ï‚· Noisy, complex, nonlinear
ï‚· Have many variables
ï‚· Non-deterministic
ï‚· Incomplete, approximate models
o Need:statistical models driven by data & sensors
o Bottom-up: use data to form a model
o Why? Complex data everywhere, audio, video, internet
o Intelligence = Learning = Prediction
o Statistician: Breiman, industry learning, very efficient
ï‚· Machine Learning Tacks
o Supervised: algorithms where we have the answers in advanced and making forecasts for
future data (a known relationship/function). The learning part happens where the results
can be compared to with expected values.
ï‚· Classification
ï‚§ ???? > 0
ï‚· Regression
ï‚§ ???? = ????
o Unsupervised: algorithms don't know in advanced the labels/clusters/relevant features.
Exploring what we see and figure out what information we have.
ï‚· Modeling/Structuring
ï‚§ ???? ????
ï‚§ Representing data, help organize data
ï‚· Clustering
ï‚§ Separating into common characteristics
ï‚§ Find what the groups are and the similar features
ï‚· Feature Selection
ï‚§ Extracting most relevant features
ï‚· Detection
ï‚§ ???? ???? < ????
ï‚§ Below a certain threshold
ï‚· Machine Learning Applications
o Interdisciplinary (CS, Math, Stats, Physics, OR, Psych)
o Data-driven approach to AI
o Many domains are too hard to do manually
o For example (any type of large data sets):