Original Paper - GradientBased Learning Applied to Document Recognition (1998) Related Video
Some other useful links