Distributed Machine Learning – Part 1 Hardware
Machine learning algorithms vary in size from a few parameters to a few billion parameters (e.g. GPT-3). The training data ranges from a few hundred training row to millions of rows. Training a model on a single CPU is not always efficient and so people started using GPU. GPU vs CPU vs GPGPU Wait a … Read more