🔥 40% Off Crucial Exams Memberships — This Week Only

3 days, 7 hours remaining!

AWS Certified AI Practitioner AIF-C01 Practice Question

Transformer-based large language models process an entire sequence of tokens at once instead of one token at a time. Which architectural feature enables this parallel computation?

Self-attention mechanism
Gradient clipping
Long short-term memory (LSTM) cells
Data parallelism

AWS Certified AI Practitioner AIF-C01

Fundamentals of Generative AI

Your Score:

Bash, the Crucial Exams Chat Bot

AI Bot