Wise Data Science
Human-in-the-Loop Threshold Management System for Machine Learning Models
Machine Learning (ML) models are at the core of modern businesses, powering everything from fraud detection to personalized recommendations. However, effectively deploying these models requires ensuring their decisions align with real-world business goals, regulatory requirements, and user experience. This often comes down to optimizing a single, critical parameter: the model threshold.
A threshold is a cutoff point used to convert continuous probability scores (e.g., a number between 0 and 1) from classification models into discrete class labels (e.g., "fraud" or "not fraud"). For example, if a model predicts a transaction has a 0.8 probability of being fraudulent, a threshold of 0.7 would classify it as "fraud," while a threshold of 0.9 would classify it as "not fraud."
Decision Flow based on Threshold. This flowchart visually represents the application of a threshold to a model's score. If the score is greater than or equal to the threshold, the item is flagged for investigation; otherwise, it is allowed to proceed.
Traditionally, optimizing these thresholds has been a complex, time-consuming task, often relegated to highly technical experts. At Wise, we faced the challenge of managing diverse customer segments, each with unique risk profiles and compliance mandates and as a solution we developed Threshold UI. This groundbreaking Human-in-the-Loop (HITL) system empowers non-technical stakeholders to directly influence and optimize ML model performance, marking a paradigm shift towards Human-Guided Machine Learning.
The Bottleneck: Optimizing ML Thresholds
While seemingly simple, selecting the right threshold is profoundly impactful and deceptively difficult, especially in high-stakes real-world applications like financial services.
1) Diverse Customer Cohorts (Subrules): Our customer base isn't monolithic. We serve individuals, businesses, and different geographic regions, each with unique risk profiles and regulatory obligations. A "one-size-fits-all" threshold often leads to sub-optimal outcomes. This necessitates different thresholds for different customer segments, which we refer to as "subrules."
This diagram illustrates how incoming transfers are processed by a fraud engine and then distributed into various customer cohorts (A, B, C, D, E). Each customer cohort is shown with a distinct numerical value (e.g., 0.7, 0.55), representing the specific threshold applied to that cohort. This highlights the necessity of having diverse, segment-specific thresholds to cater to the unique risk profiles and requirements of different customer groups.
2) Interdependent and Complex Constraints: It's not enough to optimize for each subrule independently. We face a web of interconnected constraints:
Global Constraints: For instance, maximum fraud rates required to operate in a specific region or a global limit on the number of transactions that can be held for scrutiny.
Local (Cohort-Specific) Constraints: Regulatory requirements might dictate that for customers in a specific country, the false positive rate for a particular model must be below a certain percentage, or a minimum recall rate (percentage of actual fraud caught) must be maintained for high-risk cohorts.
Business Objectives: Beyond regulation, different business objectives might apply. For one cohort, minimizing false negatives (risk of potential fraud) might be paramount, while for another, minimizing false positives (risk of disrupting legitimate transactions) could be the priority due to customer experience considerations.
3) Combinatorial Explosion: Imagine having 20 customer cohorts for a given model, and for each customer cohort hundreds of potential threshold values to select from. Manually exploring all combinations to find the set of thresholds that satisfies all global and local constraints, while optimizing for an overall business objective, is a monumental task. This quickly becomes mathematically intractable; with n = 400 thresholds across k = 20 cohorts, the search spaceconsists of nk combinations, approximately 40020 – a number roughly equivalent to the estimated number of atoms on Earth. Manually evaluating this many options is just too difficult.
4) Impact on Business and Compliance: The implications of sub-optimal thresh-old management are significant. Slow model refreshes mean models might not adapt quickly to changing fraud patterns or market conditions. Non-compliance with regulations can lead to severe penalties. Most importantly, it can negatively impact customer trust and experience if legitimate transactions are unnecessarily delayedor blocked, or if genuine risks are missed.
This "last mile" problem of ML deployment is a critical bottleneck, hindering the agility and effectiveness of our machine learning systems.
Introducing Threshold UI: Your Co-Pilot for ML Decisions
Threshold UI is a sophisticated Human-in-the-Loop (HITL) system designed to put the power of ML decision-making directly into the hands of those who understand the business context best. It combines invaluable human domain knowledge, business priorities, and regulatory constraints with advanced optimization algorithms to find the best possible solution.
1. Empowering Non-Technical Users
The platform is engineered for usability, abstracting away underlying mathematical complexity. Non-technical users can interact with Threshold UI through:
Direct Input of Business Constraints: Users define constraints using clear, business-centric language (e.g., "Can we find a threshold combination that keeps average daily suspension count below X?").
Clear Objective Function Selection: Users select the primary business objective they want to optimize for each model or subrule (e.g., minimizing overall cost, maximizing fraud detected or balancing precision and recall).
Scalability: With a growing portfolio of models and an increasing number of
customer cohorts, managing these constraints manually would be impossible.
Threshold UI automates this process, ensuring that scalability isn’t compromised
by complexity.
The Threshold UI configuration screen allows non-technical stakeholders to define model constraints and objectives. Users can set global limits (e.g., "Daily backlog limit") and subrule-specific constraints (e.g., minimum precision or recall). They can also choose the objective function (e.g., F2-score) for each subrule, translating complex business requirements into actionable optimization parameters. The "Threshold alternatives" field indicates the granularity of threshold options the underlying algorithm considers.
2. Offline Experimentation and One-Click Deployment
Threshold UI facilitates rapid, risk-free experimentation and deployment.
Generate Multiple "What-If" Scenarios: Users define several alternative sets of constraints and objective functions. Threshold UI then instantly calculates the optimal thresholds for each scenario, providing a comparative view of how different strategic choices would impact model performance.
Side-by-Side Comparison: The UI presents these different optimization options in a clear, comparative format, allowing product managers to understand the trade-offs and benefits of each.
One-Click Deployment: Once an optimal set of thresholds is identified, deploying it is as simple as a single click. This drastically reduces the time from decision to production, cutting model refresh times from hours or days to minutes.
This detailed view within Threshold UI provides a granular breakdown of metrics for each subrule across different threshold options (Baseline vs. New/Optimized). Users can examine how proposed threshold changes affect key performance indicators for specific customer segments, enabling informed decisions and fine-tuning for diverse cohorts. This helps in understanding the precise impact of optimized thresholds on various aspects of the business.
3. Ensuring Responsible AI: Threshold UI and Model Governance
Threshold UI plays a pivotal role in strengthening our model governance framework, particularly in model deployment and ongoing performance management. This includes:
Risk Management: Identifying and mitigating potential risks associated with model decisions, including bias, fairness, and unintended consequences.
Regulatory Compliance: Ensuring models adhere to internal policies, industry standards, and external regulations (e.g., GDPR, financial regulations).
Enhanced Transparency and Auditability: Every optimization run, along with the specified constraints, objectives, and resulting optimal thresholds, is recorded. This creates a clear, auditable trail of decision-making.
Scalable Governance: Threshold UI provides a scalable solution for operationalizing governance policies, allowing for consistent application of rules across a vast portfolio of models and customer segments.
4. Beyond the Numbers: The Human-Guided Machine Learning Paradigm
Threshold UI embodies a profound shift towards Human-Guided Machine Learning, recognizing that human intelligence, intuition, and domain expertise remain indispensable. It creates a synergistic relationship where humans and AI collaborate, each leveraging their unique strengths.
The design philosophy behind Threshold UI is deeply rooted in principles of Human-Computer Interaction (HCI):
Intuitive Design: Complex optimization problems are abstracted into clear, visual, and interactive elements, reducing cognitive load and broadening accessibility.
Feedback Loops: Users receive immediate, understandable feedback on the impact of their choices, reinforcing learning and building confidence.
Transparency and Explainability: While the underlying algorithm (Output Thresholding Using Mixed Integer Linear Programming - OTLP, refer OTLP: Output Thresholding Using Mixed Integer Linear Programming ) is sophisticated, Threshold UI presents its results in an easily digestible format. Users can see how constraints are met, what trade-offs were made, and the specific impact on different cohorts.
Human in the Loop: Threshold UI ensures that the human is always in control, providing the strategic direction and ethical guardrails, while the system handles the computational heavy lifting. This avoids the pitfalls of fully automated systems that might optimize for technical metrics at the expense of business reality.
Jobs you might be interested in
Salary
12,50,000 INR + RSUs
Location
Hyderabad
Team
Operations
Locations
Hyderabad
Description
We are looking for Payment Operations Senior Specialists to join our Payment Operations Regional team, with a focus on Service Delivery.Your mission: Wise has already pioneered new ways for people to
Reference
a794a68e-e860-47d7-8a04-75ecb98282f9
Expiry Date
01/01/0001
Salary
9250 - 12400 SGD Monthly + RSUs
Location
Singapore
Team
Operations
Locations
Singapore
Description
The Business Operations Associate Manager will support critical discovery, analysis, and improvement initiatives in servicing operations. This role focuses on leveraging servicing operations insights
Reference
911c266e-2f46-47ce-b455-b79c84d96a87
Expiry Date
01/01/0001
Salary
770,000 HUF Monthly
Location
Budapest
Team
Operations
Locations
Budapest
Description
We’re looking for FinCrime Investigators to join our growing AML Investigations team in Budapest. This role is a unique opportunity to have an impact on Wise’s mission, grow as a specialist in anti-mo
Reference
931eb331-544e-44b9-9b29-479c3474b817
Expiry Date
01/01/0001
Salary
650000 - 825000 INR Annual
Location
Hyderabad
Team
Operations
Locations
Hyderabad
Description
The Servicing Training Specialist role facilitates key learning and development initiatives that support business objectives and fosters employee growth. They are responsible for conducting onboarding
Reference
3285aa42-ff60-4cb2-9c59-36000321d801
Expiry Date
01/01/0001
Salary
4350 - 5850 EUR gross/monthly + RSUs
Location
Tallinn
Team
Operations
Locations
Tallinn
Description
We're looking for an experienced lead to join our growing AML Investigations team in Tallinn, Estonia. As a FinCrime Operations Senior Lead your mission is to ensure the effective assessment and mitig
Reference
dd288711-4f15-4d73-a3de-1d957816231e
Expiry Date
01/01/0001
Salary
1,490,000 - 2,000,000 HUF gross/monthly + RSUs
Location
Budapest
Team
Operations
Locations
Budapest
Description
We're looking for an experienced lead to join our growing AML Investigations team in Budapest, Hungary. As a FinCrime Operations Senior Lead your mission is to ensure the effective assessment and miti
Reference
74b74e87-2792-45e8-ad19-4f1839c1d4b9
Expiry Date
01/01/0001
Salary
6900 - 7450 BRL Monthly
Location
São Paulo
Team
Operations
Locations
São Paulo
Description
We’re looking for a Japanese speaker Sanctions Analyst, to join our growing Sanctions Screening team in Sao Paulo, Brazil. This role is a unique opportunity to have an impact on Wise’s mission, grow a
Reference
27347f87-4e28-4572-8c25-002140485f57
Expiry Date
01/01/0001
Salary
.
Location
São Paulo
Team
Operations
Locations
São Paulo
Description
The Regional Lead of AML Investigations is a leadership role responsible for the strategic direction and operational excellence of Anti-Money Laundering (AML) investigations. Beyond functional oversig
Reference
fa4e5c05-29af-48b0-8790-40f95a0c3ca8
Expiry Date
01/01/0001
Salary
4100 - 5100 EUR Monthly
Location
Tallinn
Team
Operations
Locations
Tallinn
Description
Our Servicing Operations team is on a critical path to scale by accelerating our adoption of Artificial Intelligence. We are building a hybrid workforce where every team member is empowered by an AI '
Reference
7c99a9c6-9dfa-461f-94f9-c5a49ef305ac
Expiry Date
01/01/0001
Related blogs
Teaser
People profileContent Type
BlogPublish date
03/02/2026
Summary
"When you join, you’ll see your own growth mirrored by the growth of the company. You won't just be watching change happen; you’ll be a direct part of it." Ataro Shoji (He/Him) Payment
by
Verona Hasani
Teaser
People profileContent Type
BlogPublish date
06/27/2025
Summary
We sat down with SK Saraogi, Head of Expansion APAC, to discuss our strategic expansion into Hyderabad, India 🇮🇳Read more to discover why Hyderabad is the perfect location for our second glo
by
Verona Hasani
Teaser
People profileContent Type
BlogPublish date
06/06/2025
Summary
Driven by a passion for growth and team development, discover how Anna Pavlics advanced from Agent to Team Lead in Wise's Fraud Prevention team 🚀 "We put tremendous effort into bui
by
Verona Hasani
Teaser
People profileContent Type
BlogPublish date
02/18/2025
Summary
Head of Servicing Scale and Experience, Ian Rynne, discusses his journey from starting as a Customer Support agent to becoming the Head of Servicing Scale and Experience at Wise.
by
Verona Hasani
Teaser
Our cultureContent Type
BlogPublish date
05/15/2024
Summary
My name is Cynthia. I'm a fraud agent at Wise, based in the Austin office. We strive to prevent fraudulent activity on the Wise platform. It can be tricky, as Wise supports many diff
Content Type
BlogPublish date
01/24/2024
Summary
Wise’s volunteer day is not just a perk; it’s a celebration of community, camaraderie, and making a difference.Meet Claire Adelman, Customer Support Training Specialist, and Javier Perdo
Teaser
People profileContent Type
BlogPublish date
11/24/2022
Summary
Hi! My name is Delis and I’m a Due Diligence Agent (CDD) in our Tallinn team, focusing on business verification. What this means is that I onboard high risk businesses by assessing their r
Teaser
People profileContent Type
BlogPublish date
11/24/2022
Summary
Hi! My name is Rza Mustafayev and I’m a Due Diligence Agent in our Latin America & Middle-East and Africa Region, focusing on personal and business customers. What this means is that I:Rev