How to Choose an AI Vendor as a Mid-Market Bank
The vendor demo was excellent. The reference customers were all top-20 banks. The implementation timeline was eight months. Eighteen months later, nothing was in production — and the engagement had quietly become one of the most common patterns I see in stalled AI programs at financial institutions.
Mid-market banks are the most targeted and least equipped buyers in the enterprise AI market right now. Every major vendor has a financial services vertical. Every pitch deck leads with logos of large institutions. Every sales cycle involves a demo environment that performs beautifully on curated data.
The evaluation criteria that actually predict a successful implementation at a $5B community bank are different from the criteria that matter at JPMorgan. And the most common mistake mid-market buyers make is running an enterprise evaluation designed for institutions ten times their size, against vendors who have optimized their sales process for exactly that evaluation — regardless of whether they can actually deliver at mid-market scale.
The five criteria that actually matter
Reference customers at your scale, not their biggest logos. Every vendor will offer you JPMorgan, Bank of America, or a top-10 insurer as a reference. Those references are not useful to you. What you need is a reference customer with a similar asset size, a similar technology infrastructure, a similar regulatory profile, and a similar internal team. Ask specifically: "Can you give me three reference customers with under $20 billion in assets who have gone live in the last eighteen months?" If the answer is vague or the references they produce are all enterprise-scale institutions, that is information.
The reason this matters is that implementation complexity scales non-linearly with institutional complexity. A vendor who has delivered successfully at JPMorgan has navigated a technology organization with hundreds of engineers, a data infrastructure budget larger than your entire IT spend, and an MRM team that has established protocols for every model class. Your implementation will look nothing like that. The challenges will be different. The solutions will need to be different. A vendor without mid-market references may have no idea how to handle them.
Who does the implementation — the vendor or a partner you haven't met yet. This is one of the most important questions in vendor selection and one of the least asked. Many enterprise AI vendors sell directly but implement through a network of system integrators and consulting partners. The partner handles the actual implementation work, and the vendor manages the relationship. The problem is that the partner is often not in the room during the sales cycle, the partner's capabilities vary widely across the network, and the partner assigned to your engagement may have limited experience with your specific vendor's product.
Ask directly: "Who will do the implementation work, and can I meet that team before we sign?" If the vendor pivots to a partner introduction only after contract execution, you are buying an unknown quantity. If they can introduce you to the specific people who would lead your implementation during the evaluation, that is a meaningful signal about how they operate.
Whether the vendor has navigated MRM approval at a regulated institution. Getting an AI system through Model Risk Management at a bank is a specific process with specific requirements. A vendor who has done it before knows what documentation MRM will ask for, what ongoing monitoring MRM will require, and how to structure the system to make the validation defensible. A vendor who has not done it — or who has only done it at large institutions with dedicated MRM liaison teams — may be genuinely unprepared for what your MRM team will ask.
Ask: "Can you walk me through how you have supported MRM validation at a similar institution?" A vendor who can answer this with specific examples, specific artifacts, and specific timelines has done the work. A vendor who pivots to general statements about model governance has not.
What the data egress and exit provisions look like. This is a contract question, but it belongs in the evaluation conversation because it surfaces assumptions the vendor may have about how long you will stay with them. At minimum, you need to know: what data is stored in the vendor's environment, how you access it, what happens to it if you terminate, and what format it is available in for migration. Vendors who make it difficult to answer these questions clearly are vendors who have thought about lock-in.
This matters more for AI vendors than for most software categories because AI systems often involve fine-tuning on your data, training on your historical records, or building embeddings and indices that represent your institutional knowledge. If that material lives in the vendor's environment with no clean export path, terminating the relationship means starting over — which is the most effective form of lock-in there is.
What the "when this goes wrong" support model looks like. Every vendor will describe their support in terms of SLAs, uptime guarantees, and escalation paths during the sales process. What you actually need to understand is what happens when something goes wrong in a way that doesn't fit the standard support model. When the model produces an output that a regulator questions. When the integration breaks during a system upgrade. When the vendor changes the underlying model and your MRM-approved validation is suddenly out of date.
Ask for a specific example of how they have handled a production issue at a regulated client. The quality of that answer will tell you more about their operational maturity than any SLA document. If you find yourself in a situation where the vendor relationship has already gone wrong, there is a separate playbook for that.
The criteria that sound important but aren't
The standard enterprise software RFP was not designed for AI vendor selection, and a surprising number of the criteria it produces don't predict implementation success.
Number of patents and research publications. Research output correlates with technical sophistication, not with the ability to deliver a production implementation at a mid-market bank on time and on budget. Some of the most technically impressive AI vendors have the worst implementation track records. Some of the best implementation vendors have modest research profiles. Evaluate the implementation, not the research.
Year founded and funding raised. Neither predicts product quality or delivery capability at your scale. A well-funded vendor in year two of existence may have excellent engineers and no implementation experience. A bootstrapped vendor in year eight may have a narrow but deeply proven product. Treat these as context, not as evaluation criteria.
Analyst rankings and awards. Gartner, Forrester, and similar analyst rankings are calibrated to enterprise buyers. Their evaluation methodology weights factors that matter at scale — breadth of features, size of customer base, roadmap ambition — and weights less heavily the factors that matter most to a mid-market bank: implementation quality at your scale, MRM experience, support responsiveness for smaller clients. Use analyst reports for market orientation, not for vendor selection.
How to run a proof-of-concept that tests the right things
A vendor POC should answer one question above all others: can this vendor deliver, at your institution, in your environment, on your data? A POC that runs in the vendor's demo environment on a dataset they prepared does not answer that question.
A useful POC runs on your production data, or as close to it as legal and security allow. It involves your integration team, not just the vendor. It encounters the actual complexity of your environment — your legacy systems, your data quality issues, your network constraints, your change management process. It also involves a specific person from the vendor who would lead your implementation, not just the solutions engineer who manages the sales cycle.
The POC should also test the governance path, at least at a preliminary level. Does the vendor have documentation that can support an MRM submission? Have they thought about how ongoing monitoring would work in your environment? A vendor who treats these questions as post-sale concerns is a vendor who has not done this at a regulated institution before.
The contract provisions mid-market buyers consistently fail to negotiate
Once you have selected a vendor, the contract negotiation is where the implementation either gets set up for success or set up for the kind of drift that produces stalled programs. Three provisions deserve more attention than they typically get.
Scope definition with change order thresholds. "Implementation" is not a scope. What is in scope needs to be defined in enough detail that both parties would agree on whether a given piece of work is included. Anything above a defined dollar threshold that is outside scope should require a formal change order rather than informal scope expansion. This is standard in technology contracts and frequently absent in AI vendor agreements.
Named delivery resources. The team that sold you is rarely the team that implements. Negotiate the right to approve key named resources on your engagement — specifically the implementation lead — and the right to request replacement if those resources change without your agreement. Vendors will push back on this. Push back harder.
Model change notification. If the vendor updates the underlying AI model your system depends on, you need to know before it happens. An undisclosed model update can invalidate an MRM-approved validation, change the behavior of a system you have deployed in production, and create regulatory exposure you didn't know you had. Require written notification at least thirty days before any material model change, with enough technical detail for your MRM team to assess the impact.
The vendor is evaluating you too. Mid-market banks that get the best implementations are the ones who came in knowing what they wanted, asked the right questions before signing, and made clear from the first conversation that they were a serious buyer who would hold the vendor accountable. That reputation travels.
Evaluating AI vendors right now or trying to get an existing vendor relationship back on track? I'd be glad to compare notes.
Email me