01
Data + supervision
Church livestreams create a repeatable training environment.
The wedge is not just market access. It is measurement quality. Recurring rooms, volunteer teams, and stable signal chains create a rare real-world setting where transfer can be evaluated across repeated captures instead of isolated demos.
Synthetic pairs. Same-source amateurization gives controlled curriculum data before live capture volume is large.
Live captures. Repeated stereo workflows from the same environment make longitudinal evaluation possible.
Reference supervision. Targets define production lift while keeping preservation constraints explicit.
02
Model core
The model has to change production without replacing the performance.
Kiln targets the underexplored middle regime between tiny direct-waveform systems and huge generative music models: a causal transfer stack large enough to reshape production, but constrained enough to stay attached to the original stereo performance.
Latent audio prior. Pretrained stereo latents provide production-scale music context.
Hierarchical sequence model. Long-context modeling handles reverb, balance, dynamics, and sectional movement.
Preservation losses. Timing, identity, and phase coherence keep the output anchored to the source.
03
Evaluation path
The output matters only if the same band still sounds like itself.
The goal is not a cleaner imitation of professional music in the abstract. The goal is evidence that production can be lifted while the singer, timing, phrasing, and musical identity of the original capture remain intact.
Preservation. Same singer, same timing, same feel.
Production lift. Better translation in space, balance, and dynamics.
Deployment path. Offline post-production first, reduced-width real-time inference later.