All the numerical examples of convolutions that I have seen thus far always assume that the input image has only 1 channel (for example, see this post on stack overflow). This made me wonder how the computations would work when the input image has multiple channels. I created a Sage worksheet to trace how the computations would work. You can get the code I wrote from my blog here.