CS231N-CNN Architectures

Common used architectures

AlexNet:
Imgur

VGG16:
Imgur

VGG19:
Imgur

GoogleNet:
Imgur

Inception:
Imgur

Performance of CNN models:
Imgur
The deeper model performs worse, but it’s not caused by overfitting.

Hypothsis: the problem is an optimization problem, deeper models are harder to optimize.

Solution: ResNet.

ResNet:
Imgur

Residual block:
Imgur

Complexity

Imgur
The best in the left: Inception-v4: ResNet+Inception
For the right:

  • VGG:Highest memory, most operations
  • GoogleNet: Most efficient
  • AlexNet: Smaller compute, still momory heavy, lower accuracy
  • ResNet: Moderate efficiency depending on model, highest accuracy

Forward pass time and power consumption

Imgur

Reference

http://cs231n.stanford.edu/slides/2018/cs231n_2018_lecture09.pdf