Fill the Data

per hour:
or over 3 years:
per hour
or over 3 years
per hour
or over 3 years

Break-Even Analysis

% of Reuses/Hours to Break Even:
0.00%
% Cache Hits
Cost Savings ($/3y)

Model
Parameters

KV Cache Size
Tokens/sec/GPU
Prefill vs CacheBlend Speed Ratio

Storage
Calculations

Additional Storage Cost/Hour
Additional Storage Cost/3y

Processing
Calculations

Number of Tokens cached
GPU Hours to Prefill
GPU Cost to Prefill
GPU Cost with Cache