You are viewing a free preview of this lesson.
Subscribe to unlock all 10 lessons in this course and every other course on LearningBro.
Computer vision (CV) and natural language processing (NLP) are two of the most established and impactful domains of artificial intelligence. Computer vision enables machines to interpret visual data, while NLP enables machines to understand and generate human language.
Computer vision gives machines the ability to extract meaningful information from images and video.
| Task | Description | Example |
|---|---|---|
| Image Classification | Assign a label to an image | "This image contains a cat" |
| Object Detection | Locate and classify objects | "Car at (100,50), person at (200,300)" |
| Semantic Segmentation | Classify every pixel | Each pixel labelled as road, car, sky |
| Instance Segmentation | Distinguish individual objects | "Car 1, Car 2, Car 3" |
| Pose Estimation | Detect body keypoints | Skeleton overlay on a person |
| Image Generation | Create images from descriptions | Text-to-image, style transfer |
| OCR | Extract text from images | Reading documents from photographs |
CNN Feature Hierarchy:
Subscribe to continue reading
Get full access to this lesson and all 10 lessons in this course.