👀Lecture 2-3

● Image Classification

Image classification assigns a label or category to an image based on its visual content. It is a fundamental problem in computer vision and has numerous applications such as object recognition, face detection, and image retrieval.

○ Semantic gap, challenges (17:04)

The semantic gap refers to the difference between low-level visual features extracted from an image and high-level semantic concepts that humans associate with them. The challenges in image classification include dealing with variations in lighting, scale, and orientation, recognizing objects under partial occlusion, and distinguishing between objects with similar visual appearances.

Semantic Gap:

(ppt)Challenges： viewpoint variation, intraclass variation, deformation, illumination

○ Machine learning: a data-driven approach

The data-driven approach to machine learning involves training a model using a large dataset of labeled examples. The model learns to generalize patterns from the training data and can then be used to predict labels for new, unseen examples.

● Nearest neighbor classifier

The nearest neighbor classifier is a simple but effective algorithm for image classification. It works by finding the nearest training image(s) to a test image based on some distance metric, and then assigning the label of the nearest training image(s) to the test image.

Not learning, just store

Distance Metric

有可能有误，因为focus on pixel level（color）

Decision boundaries

Pixel color level difference not useful：

● Hyperparameters

Hyperparameters are parameters in a machine learning model that are set before training and are not learned from the data. They control the complexity of the model and can have a significant impact on its performance. Examples of hyperparameters in a linear classifier include the regularization strength and the learning rate.

贵

其实deep learning在没有很多data时 work not well

● Linear classifier

A linear classifier is a type of machine learning model that learns to separate data points into different classes using a linear decision boundary. This decision boundary can be represented algebraically, visually, or geometrically.

○ Algebraic, Visual, Geometric viewpoints

（matric）

（像图）

○ Loss functions: SVM, Softmax

In a linear classifier, the loss function is used to measure how well the model is able to predict the correct class labels. Two commonly used loss functions for linear classifiers are the support vector machine (SVM) loss and the softmax loss. The SVM loss encourages the model to have a large margin between different classes, while the softmax loss is used for multi-class classification problems and produces a probability distribution over all possible classes.