Author: Ole Germann

  • Local AI Workflows: Owning the Process, Not the Model

    Most companies are thinking about AI from the wrong end.

    They start with the largest hosted model they can access.
    Then they try to build workflows around it.

    That works for experiments.

    It is weaker as an operating model.

    If every useful AI workflow depends on a hosted frontier model, the workflow is not fully yours.


    The local AI argument

    Local hosted AI does not mean every company should train or run a frontier model in its basement.

    That is unrealistic for most businesses.

    The hardware is expensive.
    The power requirements are real.
    The operational burden is easy to underestimate.

    But local AI does not need to replace the largest hosted models.

    It should replace the assumption that every workflow needs one.

    The future-proof part is not owning the best model.
    It is owning the workflow around the model.

    Back to the business example

    Imagine the same mid-sized seafood processing company from the previous posts.

    It has operational systems, an integration platform, an ETL/data platform and increasing interest in AI.

    The company wants AI to help with:

    • Summarizing integration errors
    • Explaining failed message flows
    • Drafting operational documentation
    • Suggesting schema mappings
    • Classifying incidents
    • Helping support staff understand system dependencies
    • Finding relevant logs, contracts and runbooks

    None of these require a frontier model for every request.

    They require controlled access to the company’s own context.

    %%{init: {'theme': 'dark'}}%%
    graph TD
        USER[Developer / Operator] --> UI[Internal AI Workflow]
        UI --> CONTEXT[Company Context]
        CONTEXT --> DOCS[Runbooks / Documentation]
        CONTEXT --> LOGS[Logs / Incidents]
        CONTEXT --> CONTRACTS[API Contracts]
        CONTEXT --> META[Integration Metadata]
    
        UI --> LOCAL[Local AI Model]
        LOCAL --> ANSWER[Draft Answer / Summary / Suggestion]

    This is where local hosted AI becomes useful.

    Not because it is always smarter.
    Because it is close to the data, predictable to operate and not priced per thought.

    Hosted models are powerful, but not neutral

    Hosted large models are excellent for many tasks.

    They are usually better at complex reasoning, broad language understanding, coding help and general problem solving.

    But they introduce dependencies:

    • Subscription pricing
    • Token-based API cost
    • Rate limits
    • Vendor policy changes
    • Data residency concerns
    • Latency and availability dependency
    • Workflow lock-in around one provider’s behavior

    For occasional use, that is acceptable.

    For core internal workflows, it becomes a strategic dependency.

    A hosted model can be a tool.
    It should not automatically become the foundation of every process.

    The subscription cost problem

    Subscription costs feel small when one person is experimenting.

    They feel different when AI is embedded into daily work.

    Once a workflow becomes useful, people use it more.

    • Developers ask more questions
    • Support teams summarize more incidents
    • Operations teams analyze more logs
    • Business users generate more reports
    • Automations call models without a human thinking about cost

    The cost model changes from “a few subscriptions” to “a metered dependency inside business operations.”

    That does not make hosted AI bad.

    It means hosted AI must be treated like any other external platform dependency: useful, but not free of architectural consequences.

    The hardware problem is real too

    Local AI also has a cost problem.

    Running large models locally requires hardware, power, cooling, storage, monitoring and people who know how to operate it.

    For many businesses, trying to run the largest models locally is not realistic today.

    That is why the practical local AI strategy is not:

    Run everything locally, no matter what.

    It is:

    Run the workflows locally where the task is narrow, repetitive, private or operationally important.

    The goal is not to win benchmark charts.

    The goal is to make useful internal AI workflows that survive pricing changes, vendor changes and model churn.

    The better architecture

    A future-proof AI architecture should not hard-code the business around one hosted model.

    It should separate the workflow from the model.

    %%{init: {'theme': 'dark'}}%%
    graph TD
        USER[User] --> WORKFLOW[AI Workflow Layer]
        WORKFLOW --> CONTEXT[Context Retrieval]
        CONTEXT --> DATA[Documents / Logs / Metadata / Contracts]
    
        WORKFLOW --> ROUTER[Model Router]
        ROUTER --> LOCAL[Local Model]
        ROUTER --> HOSTED[Hosted Large Model]
    
        LOCAL --> RESULT[Response]
        HOSTED --> RESULT

    The workflow layer owns the business process.

    The model is only one replaceable part.

    This makes it possible to use:

    • Local models for routine internal tasks
    • Hosted large models for difficult reasoning
    • Different models as quality, price and hardware improve
    • Fallbacks when one provider is unavailable
    • Company-specific prompts, retrieval and guardrails independent of vendor

    The workflow should outlive the model.

    Where local AI makes sense first

    Local hosted AI is strongest where the task is narrow and context-heavy.

    Examples:

    • Summarizing internal documentation
    • Searching runbooks and integration metadata
    • Drafting first versions of incident reports
    • Classifying support tickets
    • Explaining known error patterns
    • Generating test case drafts from contracts
    • Suggesting mappings between known schemas
    • Creating first-pass documentation for existing flows

    These tasks do not need a model that can solve every problem on the internet.

    They need a model that can work safely with local context.

    What local AI gives you

    Local AI gives the company different advantages than hosted AI.

    • More predictable cost for repeated internal usage
    • Lower dependency on external subscription changes
    • Better control over sensitive operational context
    • Lower latency for some internal workflows
    • Offline or restricted-network possibilities
    • Freedom to change models without redesigning the process
    • Better fit for internal tools where “good enough and always available” beats “best possible but metered”

    This is not glamorous.

    But enterprise systems are not built on glamour.

    They are built on repeatability, ownership and predictable failure modes.

    What local AI does not give you

    Local AI also has clear drawbacks.

    • Lower model quality compared to frontier hosted models
    • Hardware cost
    • Power and cooling requirements
    • Maintenance burden
    • Model evaluation responsibility
    • Security patching and dependency management
    • Need for internal competence

    Ignoring these drawbacks is unserious.

    But using them as an argument against local AI entirely is also wrong.

    The same was true for servers, databases, integrations, CI/CD and observability.

    Infrastructure always has a cost.

    The question is whether the company wants to own the capability or rent every execution of it.

    The hybrid model

    The best approach is usually hybrid.

    Use local AI for repeatable internal workflows.

    Use hosted large models for tasks where their quality clearly matters.

    %%{init: {'theme': 'dark'}}%%
    graph LR
        TASK[AI Task] --> DECIDE{Task Type}
    
        DECIDE -->|Routine / private / repeated| LOCAL[Local Model]
        DECIDE -->|Complex / rare / high reasoning| HOSTED[Hosted Large Model]
    
        LOCAL --> OUT[Workflow Result]
        HOSTED --> OUT

    This gives the company a practical balance.

    • Routine work stays predictable
    • Sensitive context can stay local
    • Expensive hosted calls are reserved for where they matter
    • The organization avoids building everything around one vendor
    • The workflow can improve as local models improve

    Why this is future-proof

    Local AI workflows are future-proof because model quality is changing faster than business processes.

    The specific model used today will probably not be the model used in two years.

    But the workflow may remain valuable:

    • Find the relevant integration
    • Retrieve its contracts and logs
    • Summarize recent failures
    • Suggest likely causes
    • Draft a support note
    • Create documentation from resolved incidents

    If that workflow is designed around your own systems, retrieval, metadata and process, the model can be replaced later.

    If the workflow is designed around one hosted chat interface, the business process is weaker.

    The durable asset is not the model.
    The durable asset is the company-specific workflow.

    AI should sit on top of owned architecture

    This connects directly to integration architecture.

    If the company has no control over integrations, metadata, logs, documentation or operational ownership, AI will not fix that.

    It will only summarize the confusion more confidently.

    %%{init: {'theme': 'dark'}}%%
    graph TD
        INT[Integration Platform] --> META[Metadata]
        INT --> LOGS[Logs]
        INT --> DOCS[Documentation]
        INT --> EVENTS[Operational Events]
    
        META --> AI[Local AI Workflow]
        LOGS --> AI
        DOCS --> AI
        EVENTS --> AI
    
        AI --> DEV[Developer Support]
        AI --> OPS[Operations Support]
        AI --> QA[QA / Testing]
        AI --> KNOW[Knowledge Base]

    Local AI becomes much more useful when it is connected to owned architecture.

    The integration platform provides reliable operational context.

    The AI workflow turns that context into assistance.

    The real conclusion

    Hosted large models are useful.

    They should be used.

    But relying on them for every internal workflow is not automatically future-proof.

    It creates dependency on subscription cost, API pricing, rate limits, provider availability and vendor direction.

    Local hosted AI has drawbacks, but it gives companies something important:

    A path toward owning the workflow instead of renting every execution of it.

    The realistic future is not fully local or fully hosted.

    It is local-first for routine, private and operational work.
    Hosted when the quality difference justifies the cost.

    That is the sustainable architecture.

  • Integration platform vs ETL/AI cloud platform

    An integration platform and a cloud ETL platform are not the same thing.

    They both move data.
    That is why they are often confused.

    But they solve different problems.

    The integration platform runs the business.
    The ETL and AI platform studies the business.


    The business example

    Return to the seafood processing company.

    It needs operational integrations:

    • Orders from ERP to warehouse
    • Batch data from production to quality control
    • Stock updates from warehouse to ERP
    • Shipment booking to transport partners
    • Invoice basis from ERP to finance

    These flows affect daily operations. If they stop, the business is disrupted.

    The same company also needs analytical data:

    • Production yield trends
    • Stock forecasting
    • Customer demand analysis
    • Transport cost analysis
    • Quality deviation patterns
    • AI models that predict demand, waste or capacity problems

    These flows help the business understand itself and plan better.

    Both are important.
    But they should not be the same platform.

    Two different responsibilities

    %%{init: {'theme': 'dark'}}%%
    graph TD
        OPS[Operational Systems] --> INT[Integration Platform]
        INT --> OPS2[Operational Systems]
    
        OPS --> ETL[Cloud ETL Platform]
        ETL --> LAKE[Data Lake / Warehouse]
        LAKE --> AI[AI / Analytics]

    The integration platform handles operational movement.

    • Fast enough for business processes
    • Reliable enough for production
    • Traceable enough for support
    • Controlled enough for change
    • Clear enough for ownership

    The ETL and AI platform handles analytical consumption.

    • Historical storage
    • Aggregation
    • Data modelling
    • Reporting
    • Forecasting
    • Machine learning and AI workloads

    One is about running the business.
    The other is about learning from the business.

    Why separation matters

    If the ETL platform becomes the integration platform, analytical concerns start leaking into operational flows.

    That creates problems.

    • Batch timing starts affecting business processes
    • Reporting transformations become operational dependencies
    • AI pipelines consume data before operational meaning is clear
    • Failure handling is optimized for data loads, not business recovery
    • Ownership becomes unclear between operations, analytics and development

    This is dangerous because ETL pipelines often tolerate things operational systems cannot tolerate.

    • Late data
    • Duplicate data
    • Partial loads
    • Reprocessing
    • Schema drift
    • Eventually correct results

    Those may be acceptable in analytics.

    They are not always acceptable when booking transport, reserving stock, approving quality documentation or sending invoice basis.

    Operational integration needs correctness at the point of business action.
    Analytics can often correct itself later.

    The wrong architecture

    A common mistake is to route everything through the cloud data platform because it already has connectors, storage and transformation tools.

    %%{init: {'theme': 'dark'}}%%
    graph LR
        ERP[ERP] --> ETL[Cloud ETL Platform]
        WH[Warehouse] --> ETL
        PROD[Production] --> ETL
        QA[Quality System] --> ETL
    
        ETL --> ERP
        ETL --> WH
        ETL --> TMS[Transport Portal]
        ETL --> FIN[Finance]
        ETL --> AI[AI / Analytics]

    This looks efficient on paper.

    But the platform is now responsible for two very different jobs:

    • Running operational business processes
    • Feeding analytical and AI workloads

    Those jobs have different priorities.

    ConcernIntegration platformETL / AI platform
    Main purposeOperational system communicationAnalytics and insight
    Failure handlingImmediate support, retry, alertingReload, reprocess, reconcile
    Time sensitivityBusiness-process dependentOften batch or near-real-time
    Data shapeContract-driven messagesAnalytical models and history
    OwnershipIntegration / application operationsData / analytics team
    CorrectnessCorrect when usedCan often become correct later

    The better architecture

    The integration platform should own operational flows.

    The ETL and AI platform should consume from stable, governed sources.

    %%{init: {'theme': 'dark'}}%%
    graph TD
        ERP[ERP] --> INT[Integration Platform]
        WH[Warehouse] --> INT
        PROD[Production] --> INT
        QA[Quality System] --> INT
        TMS[Transport Portal] --> INT
        FIN[Finance] --> INT
    
        INT --> ERP
        INT --> WH
        INT --> PROD
        INT --> QA
        INT --> TMS
        INT --> FIN
    
        INT --> EVENTS[Operational Events / Canonical Data]
        EVENTS --> ETL[Cloud ETL Platform]
        ETL --> LAKE[Data Lake / Warehouse]
        LAKE --> AI[AI / Analytics]

    This separation gives each platform a clean responsibility.

    • The integration platform keeps operations running
    • The ETL platform prepares data for analysis
    • The AI platform consumes governed and explainable data

    The AI platform should not become the owner of operational truth.

    It should consume from systems and integration events that are already understood, validated and traceable.

    Why AI makes this more important

    AI does not remove the need for integration architecture.

    It increases it.

    An AI model can only be useful if the data it consumes is meaningful.

    • What does this stock number represent?
    • When was it valid?
    • Was it corrected later?
    • Was this batch approved or only produced?
    • Was the order cancelled, delayed or partially fulfilled?
    • Which system owns the truth?

    These are integration and data lineage questions before they are AI questions.

    If the operational data flow is unclear, AI will automate the confusion.

    AI should consume from controlled flows

    In the seafood company, an AI model may predict demand, production capacity or waste.

    But it needs data that has operational meaning:

    • Confirmed orders, not abandoned drafts
    • Approved quality data, not temporary production notes
    • Actual stock, not stale warehouse extracts
    • Real shipment status, not optimistic planning data
    • Corrected historical data, not raw failure states

    The integration platform helps define and expose these flows clearly.

    The ETL platform can then store, model and prepare the data for reporting and AI.

    The rule of separation

    A practical rule:

    If the flow is needed to run the business, it belongs in the integration platform.
    If the flow is needed to understand the business, it belongs in the ETL and AI platform.

    Some data will move between both worlds.

    That is fine.

    But the direction of responsibility should be clear:

    %%{init: {'theme': 'dark'}}%%
    graph LR
        OPS[Operational Systems] --> INT[Integration Platform]
        INT --> BUSINESS[Business Processes]
        INT --> GOVERNED[Governed Operational Data]
        GOVERNED --> ETL[ETL / Data Platform]
        ETL --> AI[AI / Analytics]

    The integration platform should feed the analytical platform.
    The analytical platform should not quietly become the operational backbone.

    The real lesson

    Cloud ETL platforms are useful.

    AI platforms are useful.

    But they do not replace integration architecture.

    Before a company can safely use AI on operational data, it needs to know:

    • Where the data comes from
    • Which system owns it
    • How it moves
    • How failures are detected
    • How corrections are handled
    • What the data means in business terms

    AI does not make integration less important.
    It makes poor integration more expensive.

    The integration platform keeps business data reliable in motion.

    The ETL and AI platform turns reliable data into insight.

    Both are needed.
    They should not be the same thing.

  • Introducing the integration platform

    In the previous post, we looked at how point-to-point integrations decay as systems grow.

    The problem was not that systems needed to exchange data.
    The problem was that no one owned the exchange.

    This is where an integration platform becomes important.

    An integration platform is not just a technical runtime.
    It is the control layer for how systems communicate.


    Back to the business example

    Our example company is a mid-sized seafood processing business.

    It has production, warehouse, transport, finance, quality control and reporting systems. Each system has a valid reason to exist. The problem is that business processes cross all of them.

    A customer order may touch:

    • ERP for customer, item and order data
    • Production for batch and packing information
    • Warehouse for stock and lot handling
    • Quality system for certificates and approvals
    • Transport portal for shipment booking
    • Data platform for reporting and forecasting

    Without an integration platform, each system learns too much about the others.

    %%{init: {'theme': 'dark'}}%%
    graph LR
        ERP[ERP] --> WH[Warehouse]
        ERP --> PROD[Production]
        ERP --> FIN[Finance]
        WH --> PROD
        WH --> TMS[Transport Portal]
        PROD --> QA[Quality System]
        PROD --> DATA[Data Platform]
        QA --> DATA
        WH --> DATA

    Every system becomes both a business application and an integration engine.

    That is not a clean separation of responsibility.

    What the integration platform changes

    An integration platform introduces a controlled middle layer.

    Systems no longer need to know every other system directly.
    They communicate through a layer that owns transport, transformation, routing, retries, monitoring and traceability.

    %%{init: {'theme': 'dark'}}%%
    graph LR
        ERP[ERP] --> INT[Integration Platform]
        WH[Warehouse] --> INT
        PROD[Production] --> INT
        QA[Quality System] --> INT
        TMS[Transport Portal] --> INT
        FIN[Finance] --> INT
        DATA[Data Platform] --> INT
    
        INT --> ERP
        INT --> WH
        INT --> PROD
        INT --> QA
        INT --> TMS
        INT --> FIN
        INT --> DATA

    This does not remove complexity.
    It moves complexity into a place where it can be owned.

    The purpose of an integration platform is not to make integration disappear.
    It is to make integration visible, governable and supportable.

    What an integration platform owns

    A useful integration platform owns the concerns that should not be scattered across every application.

    • Message routing
    • Protocol handling
    • Data transformation
    • Contract validation
    • Retries and error handling
    • Monitoring and alerting
    • Traceability and logging
    • Operational documentation
    • Security boundaries
    • Versioning and change control

    These are not minor details.
    They decide whether a business process can be trusted in production.

    The integration platform as a business layer

    In the seafood company, an order is not just a row in an ERP system.

    It becomes a business event.

    • An order was created
    • A batch was produced
    • Stock was reserved
    • A quality certificate was approved
    • A shipment was booked
    • An invoice was prepared

    The integration platform should understand these flows at a technical level.

    Not because it replaces the business systems, but because it connects them in a controlled way.

    %%{init: {'theme': 'dark'}}%%
    graph TD
        ORDER[Order Created] --> INT[Integration Platform]
        INT --> STOCK[Reserve Stock]
        INT --> PROD[Notify Production]
        INT --> QA[Check Quality Requirement]
        INT --> TRANSPORT[Prepare Transport]
        INT --> FINANCE[Prepare Invoice Data]
        INT --> TRACE[Trace and Monitor Flow]

    Now the full process can be observed from one place.

    Why this matters operationally

    When something fails, the question should not be:

    Which system do we log into first?

    The question should be:

    Where did this business flow stop?

    An integration platform gives operations and developers a central place to answer that question.

    • Was the message received?
    • Was the message valid?
    • Was it transformed correctly?
    • Was the target system available?
    • Was it retried?
    • Was anyone alerted?
    • Can the message be replayed safely?

    Without this, teams investigate by guessing.

    With it, they investigate by following the flow.

    Integration platform does not mean one product

    An integration platform does not have to mean one specific vendor or one monolithic tool.

    It can be built from several technologies:

    • API gateway
    • Message broker
    • Integration runtime
    • File transfer handling
    • Transformation services
    • Monitoring and logging
    • Metadata and documentation
    • DevOps pipelines

    The important part is not the brand.
    The important part is that integration is treated as a platform responsibility, not as scattered side effects inside each application.

    What good looks like

    A good integration platform gives the organization:

    • One place to see important flows
    • Clear ownership of integration behavior
    • Reusable patterns for APIs, files, messages and events
    • Consistent logging and alerting
    • Safer change management
    • Better separation between business systems and transport logic
    • More reliable data for reporting and AI

    The integration platform is where data movement becomes an owned capability.

    The real goal

    The goal is not to make every integration complicated.

    The goal is to make important integrations boring.

    Boring means:

    • Visible
    • Logged
    • Monitored
    • Documented
    • Repeatable
    • Supportable
    • Safe to change

    That is what a business needs from its integration platform.


    In the next post, we separate the integration platform from another important layer: the cloud ETL and AI data platform.