Skip to content

Deep Learning

Status: 🚧 Coming soon β€” chapters are being written.

Deep learning uses neural networks with many layers to learn representations from data. It's behind modern computer vision, NLP, speech, and the LLMs you've been studying in the GEN AI section.

What this section will cover

  • Neural network fundamentals β€” neurons, layers, activations, backprop
  • Frameworks β€” PyTorch (primary), TensorFlow/Keras for comparison
  • Optimization β€” SGD, momentum, Adam, learning-rate schedules
  • Regularization β€” dropout, batch norm, weight decay, data augmentation
  • Architectures β€”
    • CNNs for vision
    • RNNs / LSTMs / GRUs for sequences
    • Transformers (the architecture behind LLMs)
  • Transfer learning, fine-tuning, parameter-efficient fine-tuning (LoRA / QLoRA)
  • Training tricks: mixed precision, gradient accumulation, distributed training

Deep-learning-adjacent context lives in these existing sections:

A dedicated deep-learning track (PyTorch + transformers) is next.