There is no contradiction.
Sorry, friend, you need to read more thoroughly. I am familiar with that article. It clearly states that the 4k cluster size aims for limiting slack space. It also agrees that larger clusters improve speed. I was suggesting down to 4k sizes if you are mostly using small files. If you mostly use large files there is no benefit in small cluster sizes i.e. you might as well user larger clusters to maximize performance.
Then there are the other limitations I mentioned. For instance, you may want to share a card between camera and PDA but the camera does not support FAT32. One of my cameras, the Kyocera, is like that.