Consume energy efficient models

GENSUS03: How do you maintain model efficiency and resource optimization when working with large language models?

Explore strategies for enhancing model efficiency and resource optimization in large language models, focusing on techniques like quantization, pruning, and fine-tuning smaller models for specific tasks. Consider the benefits of model distillation to create efficient, task-specific models. Aim to balance performance with computational requirements, helping achieve optimal resource utilization in generative AI applications.

Best practices

GENSUS03-BP01 Leverage smaller models to reduce carbon footprint

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

GENSUS02-BP01 Optimize data processing and storage to minimize energy consumption

GENSUS03-BP01 Leverage smaller models to reduce carbon footprint