THE BRIEFING ROOM

How to run a CMS proof of concept without wasting three months

The last CMS proof of concept I watched go sideways had everything going for it on paper. Three platforms shortlisted. A twelve-week evaluation window. A twenty-scenario test plan that the technology director had spent two weeks putting together. Proper budget, proper governance, a steering committee that actually showed up to meetings.

Twelve weeks later, the team had tested all twenty scenarios across all three platforms and could confidently tell the board... almost nothing. Every scenario had been touched. None had been tested with any depth. The editorial team had spent about forty minutes in each platform - barely enough to find the login screen, let alone form a view on whether they could actually publish a quarterly insights report in it. The integration with their CRM had been "tested" in a sandbox with twelve clean records, which told them roughly as much about real-world performance as test-driving a car in an empty car park tells you about the M25 at 8am.

The steering committee asked for a recommendation. The technology team gave them a spreadsheet with scores out of five across twenty criteria, none of which anyone felt strongly about, because none of the evidence was deep enough to feel strongly about. The decision got deferred. Another quarter lost.

I see this pattern more often than I'd like. And the root cause is almost always the same: the PoC tried to evaluate everything, and ended up producing reliable evidence about nothing.

The PoC is not a comprehensive evaluation

Here's where most technology leaders get this wrong, and I say this with genuine sympathy because the instinct is completely rational.

We're making a major platform decision. We should test as many scenarios as possible to make sure we're thorough.

I get it. The stakes are high. A CMS decision is a five-to-ten year commitment for most firms. The fear of picking the wrong platform is real, and the idea that more testing equals more certainty feels intuitive. But it's wrong. More testing at shallow depth equals more noise and less certainty. You end up with a spreadsheet full of amber scores that nobody can interpret.

A good PoC is a targeted hypothesis test. It answers the specific questions about this platform that you cannot resolve through demos, vendor documentation, reference conversations, or desk research. That's a much smaller set of questions than most people think.

I've written separately about the broader CMS evaluation process - the decision matrix that helps you work through criteria systematically before you ever get to a PoC. The PoC is stage three of that process. It exists to test the things that survived every other filter and still can't be answered without putting your hands on the keyboard.

What belongs in the PoC (and what doesn't)

A good PoC scenario has three characteristics. Miss any one of them and you're burning evaluation time.

First: it tests something genuinely uncertain. If everyone already agrees the platform handles this well - because you've seen it in demos, read the documentation, and spoken to three reference clients who confirmed it - testing it in the PoC is theatre. Reassuring, perhaps. But not a good use of a limited evaluation window.

Second: it tests something genuinely consequential. Not every requirement carries equal weight. If the platform fails on this scenario, is that a deal-breaker? Or a manageable limitation you'd work around? Deal-breakers belong in the PoC. Nice-to-haves don't. Be honest with yourself about which is which.

Third: it tests something that can actually be evaluated in the available time. I once watched a team include a "full content migration simulation" as a PoC scenario. The migration required three weeks of data preparation before testing could even begin. In a two-week PoC window. They got nowhere, obviously, and then marked the platform down for it. That's not a platform failure - it's a scoping failure.

For most B2B service firms evaluating a CMS, the high-value scenarios tend to cluster around three areas. The editorial workflow for your most complex content type - and I mean tested by a real editor following the actual editorial process, not a developer clicking through screens to prove the capability exists. The integration with your primary CRM or client management system, tested against a realistic data set with actual inconsistencies and edge cases, not a clean sandbox with twelve perfect records. And platform performance under realistic content volume and traffic conditions - your actual page count, your actual traffic patterns, not the vendor's optimised demo environment.

Three scenarios. Maybe four. That's usually enough.

The vendor-led PoC trap

Right, this is the bit where I suspect a few people will recognise themselves.

The vendor offers to "help design the PoC." They suggest scenarios. They provide the environment. They walk your team through the test. They're in the room while your editors try the platform for the first time, helpfully pointing out features and shortcuts.

This is not a proof of concept. This is a guided tour.

I'm not saying vendors are being dishonest. They're doing what any sensible commercial organisation would do - showcasing their strengths. The scenarios they suggest will be scenarios they know the platform handles beautifully. The environment will be clean, optimised, pre-configured. The data will be perfect. And the helpful person in the room will - without meaning to - change your editors' behaviour. People perform differently when they're being watched by someone who built the thing they're testing.

I ran a PoC a while back where we'd specifically asked the vendor not to attend the editorial testing sessions. They agreed. Then on the day, their solutions engineer turned up anyway - "just to observe." Within about twenty minutes, one of the editors had asked him a question, he'd answered it, and the whole session had quietly become a demo. We had to restart it the following week with the vendor off-site. The results were noticeably different. The editors found two things in the second session that they'd completely missed in the first, because nobody was there to steer them away from the awkward bits.

The most valuable PoC evidence comes from situations where the vendor is not in the room. Where your editorial team is working in the platform environment without assistance, under conditions that approximate how they'd actually work with it day-to-day. Where they get stuck on something and have to figure it out, because that's exactly what will happen in production.

The vendor absolutely has a legitimate role - provisioning the environment, providing initial configuration support, being available to answer technical questions. That's fair and reasonable. What they shouldn't do is define the scenarios, set the success criteria, or sit in on the user testing sessions. Keep those boundaries clean.

This is also, incidentally, where the quality of your implementation partner becomes visible. A good partner will help you design a PoC that genuinely stress-tests the platform against your requirements. A less good partner will design one that makes the platform - and by extension, their recommendation - look as strong as possible.

Scoping the PoC: four decisions that matter

Before you talk to any vendor, before you provision any environment, make four decisions.

Decision one: define your three or four scenarios. Base these on whatever evaluation framework you've been using. The PoC tests the criteria that survived every other evaluation stage and still can't be resolved without direct experience. Write them down. Be specific. "Test the editorial workflow" is not a scenario. "An editor with no CMS training creates, routes for approval, and publishes a multi-author insight article with embedded data visualisation and related content links, following the firm's actual editorial sign-off process" - that's a scenario.

Decision two: define success criteria before the PoC begins. For each scenario, articulate what a pass looks like, what a marginal result looks like, and what a fail looks like. Specific enough that two people running the same scenario independently would reach the same conclusion. If your success criteria are vague, you'll interpret the results through the lens of whichever platform you were already inclined to prefer. I've seen it happen. The post-PoC meeting becomes a Rorschach test where everyone sees what they wanted to see.

Decision three: limit the PoC to two to three weeks per platform. This is counterintuitive. Surely longer is better? In my experience, no. Longer PoCs produce more data but rarely produce better decisions. The additional time gets spent on scenarios that don't add decisional value, or on re-testing things that already produced a clear result. Two to three weeks per platform, focused on three or four deep scenarios, is enough. If it's not enough, the problem is usually that you've included scenarios that shouldn't be there.

Decision four: involve real end-users in end-user scenarios. The editorial team tests the content creation workflow. Not the development team assessing whether the workflow is technically sound. These are different questions with different answers. A platform can be technically excellent and editorially miserable. Your developers won't spot that. Your editors will - in about fifteen minutes.

Reading the results honestly

When the PoC is done, most teams do one of two things. They either declare a winner based on whichever platform felt best overall, or they get stuck in a committee argument because the results are genuinely mixed and nobody defined success criteria clearly enough to resolve it. Both are avoidable.

The thing I've noticed is that PoC results tend to get misread in a particular direction: teams are reluctant to call a fail a fail. A platform falls short on something that was explicitly identified as a deal-breaker, and suddenly there's a conversation about whether it was really a deal-breaker, or whether the scenario was set up fairly, or whether the vendor could fix it by go-live. Sometimes they can. Usually they can't. But the sunk cost of a twelve-week evaluation makes it very hard to walk away.

I watched this happen with a financial services client a few years ago. One platform had performed well across most scenarios but had a genuinely problematic compliance workflow - editors could bypass the approval process with about three clicks, which in a regulated environment wasn't a "we'll work around it" situation, it was a "we cannot ship this" situation. The vendor said they'd have a fix in the next release. The client wanted to believe them. We had a fairly direct conversation about what "next release" typically means in practice, and they chose a different platform. Six months later, the fix still hadn't shipped.

So, when you're reading the results, be disciplined about three things.

A clear pass means the platform met or exceeded success criteria on all scenarios. Proceed to reference checks and commercial evaluation. A successful PoC reduces specific identified risks - it doesn't mean implementation will be smooth. Unexpected problems will still surface. But you've got evidence-based confidence where it counts.

A qualified pass - the platform met criteria on some scenarios but not all - is where most PoCs land, and where most evaluation processes get sloppy. The question isn't "did it mostly pass?" The question is: what specifically did it fail on, and does that failure matter enough to eliminate it? Make that judgment explicitly and write it down. Don't let it slide into a vague "it was pretty good overall."

A clear fail on a scenario you identified as critical means it's out. I know that's hard when you've invested weeks in the evaluation and the vendor has been lovely and the licence pricing is attractive. But this is exactly what the PoC was designed to discover. Finding it now is a lot cheaper than finding it six months into implementation.

Document everything. Scenario descriptions, success criteria, observed results, and the judgment you made on any qualified outcomes. When someone asks in eighteen months why you chose Platform A over Platform B, you want to be able to point to evidence rather than trying to reconstruct a decision from memory.

Making it manageable

I realise that everything I've described sounds like a lot of work. And compared to the "let the vendor run a demo and call it a PoC" approach, it is. But compared to the twelve-week, twenty-scenario evaluation that produces an inconclusive spreadsheet? It's dramatically less work. And it actually produces a decision.

Two to three weeks per platform. Three or four scenarios tested in genuine depth. Real users in the room. Success criteria defined before you start.

That's the whole thing.

If you're about to run a PoC - or you've just been through one that didn't give you what you needed - the hardest part usually isn't running the evaluation. It's choosing the right three scenarios in the first place. If you'd find it useful to work through that with someone who's done this for comparable firms, a PoC planning session is a good place to start. Sometimes an outside perspective earns its keep in the first hour.