dandy.calculator.calculations.llm.model_calculations
QuantizationBitSizes = Literal[64, 32, 16, 8, 6, 5, 4, 3, 2]
module-attribute
hidden_dimension_and_number_of_layers_from_parameter_count
dandy/calculator/calculations/llm/model_calculations.py
key_value_cache_per_token_size_bytes_calculation
dandy/calculator/calculations/llm/model_calculations.py
key_value_cache_size_bytes_calculation
dandy/calculator/calculations/llm/model_calculations.py
model_inference_activation_size_bytes_calculation
model_size_bytes_calculation