Skip to content

Computer Vision

Teaching machines to interpret images and video — from recognition to generation.

Computer Vision is one of the core areas in the AI University map of AI. Explore the diagram, then dive into each topic — every subtopic grows into its own deep-dive over time.

flowchart TB
  IMG[/Image or Video/] --> CL[Classify]
  IMG --> DET[Detect]
  IMG --> SEG[Segment]
  IMG --> GENV[Generate]
  IMG --> D3[3D / Scene]

Key topics

  • Image classification


    Assign labels to images; the task that kicked off the deep-learning era (ImageNet).

  • Object detection & segmentation


    Locate and outline objects (bounding boxes, pixel masks).

  • Image & video generation


    Create visual content with diffusion and transformer models.

  • 3D & scene understanding


    Depth, pose, neural radiance fields, and reconstructing the world from images.

  • Multimodal vision-language


    Models that jointly understand images and text (captioning, visual question answering).

Deep Learning · Generative AI · Robotics & Embodied AI


Learn this properly

Want hands-on training in computer vision? Explore AI University courses and AI School camps for kids.