Fill the Data
per hour:
or over 3 years:
per hour
or over 3 years
per hour
or over 3 years
Break-Even Analysis
% of Reuses/Hours to Break Even:
0.00%
% Cache Hits
Cost Savings ($/3y)
Model
Parameters
KV Cache Size
—
Tokens/sec/GPU
—
Prefill vs CacheBlend Speed Ratio
—
Storage
Calculations
Additional Storage Cost/Hour
—
Additional Storage Cost/3y
—
Processing
Calculations
Number of Tokens cached
—
GPU Hours to Prefill
—
GPU Cost to Prefill
—
GPU Cost with Cache
—