Inference Options
SageMaker Serverless Inference automatically scales compute capacity based on traffic and does not charge when there are no requests (idle), making it ideal for intermittent traffic.
check_circleTherefore, the correct answer is: SageMaker Serverless Inference.
cancelThe option that says: Real-time Inference is incorrect because it keeps instances always running, incurring costs during idle periods.
cancelThe option that says: Batch Transform is incorrect because it is for offline batch processing, not real-time recommendations.
cancelThe option that says: Asynchronous Inference is incorrect because it is designed for large payloads and long processing times, not necessarily for optimizing unpredictable fast traffic costs.