A request enters at the edge, runs through an engine that executes code or model weights, then reads or writes data. Here is the platform that makes that happen.
Node runs the assistant, Lucee → Tomcat → JVM runs the web apps, and Ollama runs the models — while nginx is the shared front door to all three.
From bare metal up to the deploy pipeline — the vocabulary anchored to where it actually lives.
An engine is software that takes code or model weights and actually executes them. The platform runs three, each sitting behind the same nginx front door.
The inference server. Hosts the models — Ollama is the engine; gemma and the OCR models are the weights.
The assistant / agent. Talks to a model backend and orchestrates around the answers.
The CFML tier. Lucee → Tomcat → JVM: bytecode served as servlets through the Jakarta EE web container.
The shared front door across all of them — and PostgreSQL is the system of record.
Talk to our engineering team about how the AkimTechCloud stack fits your workloads.
Contact Sales →