In the age of AI, large corporations—not just startups—can move fast
At verticallm.ai, we’ve seen firsthand that creating an environment where small, scrappy teams can innovate without being slowed down by endless reviews is key to unlocking AI’s potential. Too often, even agile teams in large organizations get stuck in molasses. Privacy, marketing, legal, and financial reviews all pile on—each valid on its own, but collectively a major brake on speed. When an engineer needs approval from five vice presidents to ship a prototype, how can you expect innovation?
And yet, the paradox: we now have the tools to move faster than ever.
AI-assisted coding. Rapid prototyping. LLMs and vertical agents that can be tested in days. But while the tech moves fast, corporate systems haven’t kept up.
So what’s the answer?
A sandbox.
A controlled environment where innovation can happen quickly—without putting your company, your brand, or your customers at risk. Testing Agents, RAG, Tools in a controlled environment.
Why Sandboxes Work
Think of a sandbox like a lab. You don’t build the final product there—you explore, you test, you learn. You make mistakes early, where it’s safe. A sandbox is not about building a polished enterprise-grade solution. It’s about answering questions like: can we use RAG to surface the right knowledge to the right user, in real time, can agents improve the workflow of our supply chain analysts, can this tool be customized to fit our specific vertical—without months of development?
In our experience at verticallm.ai, a good sandbox gives you: (1) Speed, build and test ideas within days or weeks, not quarters; (2) Safety, keep experiments isolated from production systems and sensitive data; (3) Structure, let teams innovate with just enough guardrails—not red tape.
What You Need to Get Started
Here’s a concrete blueprint we’ve used—refined with our partners inside large companies
Set a Clear Scope (Don’t Build the World)
Start with a narrow business question or use case. Examples:
- Can we summarize customer complaints using internal data and a RAG model?
- Can we use an agent to automate 20% of our weekly KPI reporting?
- Can AI surface lead time risks from our supply chain dashboards?
Small questions lead to fast, clear results.
Define the Boundaries of Your Sandbox
You need technical and organizational constraints:
- Data: Use anonymized, non-sensitive, or synthetic data.
- Access: Restrict access to a small team of builders and testers.
- Environment: Run tests in a secure, isolated cloud instance.
- Governance: Define what needs approval—and what doesn’t. Make it explicit.
Tip: A sandbox doesn’t need to connect to your enterprise systems to be valuable. Simulate where needed.
Choose Your Stack Wisely
Your sandbox is not your product stack. Use tools that are:
- Lightweight (no 6-month implementations)
- Composable (easy to swap components)
- Interpretable (you need to explain what the model is doing)
For RAG and agent experimentation, we recommend:
- LLM access via APIs (e.g., OpenAI, Anthropic, or open models via HuggingFace)
- Vector databases like Pinecone, Weaviate, or Chroma
- LangChain or LlamaIndex for orchestration
- Streamlit or Gradio for building quick front-ends
We also offer our own agent framework at verticallm.ai, optimized for supply chain verticals—reach out if you’d like early access.
Put a Team in Place—and Shield Them
A sandbox only works if the team can actually move. That means:
- One product owner: Someone who knows the business pain point.
- One AI engineer or builder: Doesn’t need to be an LLM expert—just someone comfortable with APIs and prototyping.
- No committee reviews. Weekly check-ins with a single exec sponsor is enough.
This is not a transformation team. It’s a strike team.
Common mistakes
A common pitfall is trying to solve everything at once. Everybody should focus on one sharp use case and try to get results fast. Another showstopper is usually getting blocked by data access or low quality data. A fix is to use public, dummy, or anonymized data to start. Remember, it is not about the actual value yet. The value is in getting started and experience. Another pitfall is overbuilding. really not to focus to much on CI/CD, security audits, and enterprise architecture (for now). Finally we saw that organizing the cases in small teams, not too many stakeholders by keeping the core team small and update others weekly.
Want to know more?
Become a member or development partner, and gain exclusive access to the Verticallm AI Guide—a comprehensive document that outlines how to leverage our AI platform effectively within your organization. The guide details our methodology, structured surveys, integration of numerical data, and best practices for extracting actionable insights from your supply chain. As a member, you’ll also receive access to dedicated webinars and resources that will support you in implementing and refining your AI strategy.
Read more about our methodology
Ground Rules for a Corporate AI Sandbox
Start Narrow—Prove Value Quickly
Pick one clear, focused use case (e.g., summarize customer emails, automate KPI checks).
No boiling the ocean. If your scope takes more than 3 weeks, it’s too big.
Use Dummy or Anonymized Data
Only use non-sensitive, public, or sanitized data.
If you must use internal data, strip out PII and secure access.
Rule of thumb: If legal needs to review it, it doesn’t belong in your first test.
Build Ugly, Fast
This is not a production system. It’s a learning lab.
Use Streamlit, Jupyter, or even CLI demos. No need for slick UI.
Focus on answering: “Does this work?” Not: “Is this beautiful?”
Demo Every Week
Show progress, no matter how small.
Share failures and dead ends—that’s what the sandbox is for.
Short demo sessions (15–30 min) with one sponsor or stakeholder max.
No Permission Loops
Agree upfront: what can the team do without asking?
For example: “You don’t need approval to deploy inside the sandbox” or “You can test on dummy data without data governance review.”
Keep approval surfaces small and explicit.
Document Learnings as You Go
Use a shared doc, wiki, or Slack channel.
What worked, what didn’t, what you’d do differently.
This becomes your internal playbook for future sandboxes.
Keep the Team Tight
Max 3–4 core contributors.
Everyone should be able to ship something—not just discuss.
If more than 6 people are on a call, you’re probably out of sandbox mode.
Everything Lives Inside the Sandbox
No integration with production systems.
No direct access to customer-facing tools.
Think of it like a lab-in-a-box. Experiments don’t leak.
Join our partner program or get Early Access
Join our partner program or get Early Access
How to build a powerful and safe
In-Company AI
We are helping our clients setting, training and deploying a company specific, on-premises, version of our vertical.
Register here and our team will contact you.
In-company RAG
Start building you In-company RAG system and learn how to train your model. Learn the basic setup and common pitfalls when it comes to in-company AI.
Create a safe environment | the sandbox
We’ve seen firsthand that creating an environment where small, scrappy teams can innovate without being slowed down by endless reviews is key to unlocking AI’s potential.
How to deploy In-Company AI
Building a Company-Specific Vertical AI: A Practical Guide for C-Level and Supply Chain Leaders. Why Companies Are Exploring Vertical AI.
What we Learned building our Vertical
Going from an abstract vision of “AI that works for supply chain” to a functional, compliant, production-ready AI stack is harder than it sounds
When numbers start talking
Our founder Bas Groothedde meets our In-Company AI Agent AVA. Great stuff.
Giving your AI context in the supply chain
Context-prompting, an important tool to give an llm context. There is strong theoretical and empirical evidence that providing AI with better context improves its performance.
The Evolution of AI Agents
The landscape of AI agents has advanced from basic chatbots to sophisticated entities managing complex tasks across industries.
This is World interview featuring Professor Yann LeCun
LeCun brings clarity to the current discourse on AI with a number of thoughtful, remarks and putting it into perpective. A must-watch for anyone navigating the future of artificial intelligence.
AI is moving so fast, which is why your strategy should be long-term.
Danny Hillis, a pioneer in AI He understood something decades ago that many executives are only now coming to terms with. Technology does not evolve in stable, predictable increments.
The future of AI is not just language models. It’s causal reasoning.
Judea Pearl, is another true pioneering in artificial intelligence. But he is a little bit different. And not on the bandwagon if LLMs. Teach machines to understand the question why.
Intelligence is not what we think it is
In our series about AI, we came across the work of Professor Herbert “Herb” A. Simon, one of the founding figures in artificial intelligence.
Why the human factor remains essential in the Digital Era
No matter how advanced technology becomes, human insight remains essential in managing real-world complexity.












