Transformer is kind of designed for the GPU…we want an architecture that is fundamentally extremely parallelizable.
Share this post
Andrej Karpathy on AI infra of the future…
Share this post
Transformer is kind of designed for the GPU…we want an architecture that is fundamentally extremely parallelizable.