Case Study

65% AI-Powered Customer Service Automation for a Global Affiliate Network

Softwise.AI partnered with MyLead, a global affiliate marketing network, to automate the approval and rejection of publisher campaign applications - a high-volume, sensitivity-laden process previously handled entirely by customer-service agents. Using a tailored hybrid approach that combines a low-cost statistical classifier with GPT-4 for the harder, language-rich cases, MyLead now automates 65% of its customer-service workload (~16,200 requests per month), has freed six full-time agents to focus on complex cases, and saw a 1200% ROI in the first year.

CLIENT

MyLead

YEAR

2024

INDUSTIRES

Affiliate Marketing - Customer Service & Operations

DELIVERABLES

Hybrid Classification System (Statistical + GPT-4), Discovery & Historical Data Analysis, Production Deployment, Post-Launch Quality and Reliability Optimization

The problem

MyLead operates a global affiliate marketing network connecting thousands of campaigns with publishers around the world. Every day, the customer-service team has to decide whether a given publisher should be accepted into a given campaign - matching the campaign's specifications against the publisher's marketing strategy, considering language, audience, and brand-safety constraints. It is high-volume work where the wrong call carries real risk - placing a children's product campaign on an adult site, for example, is the kind of mistake that damages both the advertiser relationship and the brand.

MyLead logo

The pressure was structural: large daily volumes, sensitivity that demanded careful judgment, seasonal spikes, lost revenue every minute a publisher sat waiting in the queue, and a workforce that took weeks to onboard. Pure-human handling capped how fast the network could grow.

Softwise.AI was asked to figure out, quickly and concretely, how much of this work could responsibly be automated - and then to build the solution.


{Key Challenges}

Why human-only approvals couldn't keep up with a global affiliate network?

Discovery surfaced an interlocking set of operational pressures shaping the brief:


  • High daily volume

    MyLead works with thousands of campaigns daily and must ensure each campaign's specification matches the marketing strategy described by the publisher.


  • Sensitivity of campaign-to-site matching

    Customers and publishers come from very different segments worldwide - so it's critical not to place certain campaigns (e.g. children's products) on incompatible sites (e.g. adult content).


  • Seasonality

    Volume varies sharply by time of day and time of year, making consistent staffing inefficient.


  • Opportunity cost while publishers wait

    Every minute a publisher waits to be accepted into a campaign is a minute they cannot earn - and a minute MyLead does not capture revenue.


  • Human errors and downstream complaints

    Manual decisions inevitably produce mistakes. Complaints mean less satisfied publishers, longer wait times, and more work resolving the very issues that automation could prevent.


  • Slow, expensive agent onboarding

    New agents need weeks of training to be effective - driving up costs and creating a constant recruitment burden.


{Process}

How we built a hybrid statistical + GPT-4 automation in 4 stages?

Softwise.AI delivered the engagement in four clear stages, prioritising fast feedback to MyLead so the company could make informed decisions about scope and investment at every step.

1. Discovery - data analysis on a million historical inquiries

The Discovery focused on what MyLead already had: roughly 1,000,000 historical inquiries with accept/reject labels. The team worked with a deliberately bounded sample of 5,000 records - large enough to surface real correlations, small enough to analyze cleanly, and current enough to avoid issues with parameters not tracked in older data. Workshops with MyLead were used to add new data points as correlations emerged. Two feature groups dominated the analysis: application data (user scoring, description) and campaign data (provider sensitivity, restrictions).

2. Statistical solution - the cheap, accurate first pass

Before reaching for an LLM, the team tested whether a simpler statistical approach could carry part of the workload. Two clean correlations emerged: user scoring below 12 and campaigns categorized as "sensitive" together produced reliable auto-rejection on 23.4% of all applications with ~94% accuracy - comparable to human performance, at essentially zero LLM cost.

3. GPT solution - language-aware classification for the harder cases

For the remaining cases, the team evaluated multiple LLMs. GPT-3.5 would have been enough for English-only work, but MyLead's ~40 languages (across Europe and Asia) required GPT-4's stronger multilingual accuracy - despite being roughly 10x more expensive than GPT-3.5. A carefully engineered prompt instructed the model on MyLead's decision criteria and required structured outputs. The GPT-4 layer added auto-rejection on another ~23.5% of applications at ~75% accuracy - a level MyLead accepted given that historical data itself contains human error and genuinely borderline cases.

4. Proof-of-Concept, MVP, and post-launch iteration

After Discovery, the team validated the model on a held-out test set in a pre-production environment, then took the MVP into production. Within ~3 months, results exceeded original estimates: 65% combined automation - higher than the 43% projected from Discovery. Iteration after launch tightened the system further:

  • Quality - prompt changes made the model less lenient, defaulting to rejection under uncertainty.

  • Explainability - a diagnostic mode was added so every decision came with an explanation.

  • Reliability - Temperature and Top_p parameters were tuned to give consistent classifications for identical inputs.

Solution

The system MyLead now runs in production is a hybrid - a cheap statistical layer handling the clear-cut cases, with GPT-4 reserved for the language-rich and ambiguous ones. The architecture is deliberately layered for cost-efficiency, accuracy, and explainability.

Statistical layer - clear-cut auto-rejection

A lightweight rules-based classifier looks at user scoring and campaign sensitivity. Where the combination is unambiguous, the application is auto-rejected without involving an LLM. This layer alone handles roughly a quarter of all applications at ~95% accuracy - effectively free of LLM inference cost.

GPT-4 layer - multilingual, criteria-aware classification

Everything the statistical layer doesn't decide on goes to GPT-4. A carefully engineered prompt encodes MyLead's decision rules and required output format. GPT-4 was chosen over GPT-3.5 specifically for its stronger handling of the ~40 languages MyLead processes - the multilingual accuracy difference justified the higher per-call cost.

Diagnostic mode for explainability

Every decision the model makes is accompanied by an explanation - turning the system into something MyLead's team can audit, debug, and continually improve, rather than a black box.

Tuned for reliability

Temperature and Top_p sampling parameters were adjusted post-launch to ensure that identical inputs produce identical classifications - a non-trivial property in production LLM systems and a requirement for any operations team that needs to defend its decisions.

Designed to grow

The architecture is built to evolve: prompt updates and statistical-rule refinements can both be made without rebuilding the system, and new languages or campaign categories can be folded in as MyLead's network expands.


{Results}

65% automation, 16,200 requests monthly, and a 1200% first-year ROI

After approximately three months in production, the system delivered results well above the initial Discovery projections:

  • 65% automation rate of all customer-service inquiries (up from a projected 43%).

  • ~16,200 requests automated per month.

  • 6 full-time agents freed from routine work and reassigned to complex tasks.

  • 1200% ROI in the first year.

  • ~2 calendar months from first meeting to production (across summer holidays, other open projects, and standard operational work).

  • Coverage across ~40 languages without separate models per region.

  • Decision-by-decision explainability through the built-in diagnostic mode.


{Business impact}

From a customer-service bottleneck to a scalable growth lever

  • Faster publisher onboarding into campaigns. Less waiting means publishers earn more, sooner - and MyLead captures more network activity per day.

  • Six agents reallocated to high-value work. The most experienced people no longer spend their days on routine accept/reject decisions; they focus on the complex cases where human judgment actually matters.

  • Lower error and complaint volume. Consistent, criteria-driven decisions reduce the noise that human fatigue and seasonality introduced.

  • Headcount-independent scaling. Growth in campaign and publisher volume no longer maps directly to agent headcount.

  • A platform that learns from itself. Diagnostic mode and tunable reliability mean MyLead can continuously improve the model without rebuilding it.


{Key recommendations}

5 lessons from hybrid AI automation at production scale

A few principles from the MyLead build that apply to any team automating high-volume operational classification:

  1. Try a statistical model before reaching for an LLM. Cheap, deterministic rules can carry a quarter of the workload at near-human accuracy. Sending those cases to GPT is wasteful and slower.

  2. Quality of the dataset beats quantity. 5,000 carefully chosen records produced more useful insight than the full million would have - older data lacks parameters that only started being tracked later.

  3. Choose model size by the use case, not the hype. GPT-3.5 would have been fine for English; GPT-4 was required for genuine multilingual accuracy. The per-call cost difference is worth it only when the use case demands it.

  4. Engineer reliability explicitly. Temperature, Top_p, and prompt structure are not implementation details - they determine whether identical inputs produce identical decisions, which is what makes the system defensible in operations.

  5. Build explainability in from the start. A diagnostic mode that justifies each decision turns the system into something the business can debug, audit, and improve - not a black box that quietly drifts.

SUMMARY

Softwise.AI delivered a hybrid statistical + GPT-4 automation system that took MyLead from a fully manual customer-service queue to 65% AI-driven decisioning - covering roughly 16,200 monthly requests, across ~40 languages, with full explainability and a 1200% first-year ROI. Built in approximately two calendar months from first meeting to production, the system is a concrete demonstration that disciplined, cost-aware AI engineering - statistical layers first, LLMs where they earn their place - delivers operational excellence at low initial investment and rapid time-to-value.

TESTIMONIAL

"

"

Our collaboration with Softwise.ai has been instrumental in reshaping MyLead's customer service landscape. With a 65% AI-powered automation rate, efficiently managing 16,200 monthly requests, we've witnessed a significant transformation and an impressive ROI within the first year.

The integration has been seamless, enhancing operational efficiency and, more importantly, customer satisfaction. The precision and speed of AI-driven responses have notably improved our service quality.

Women Side Pose

Paweł Wojcieszak

Head of Affiliate, MyLead

Contact us!

By submitting my information, I confirm that I have read and understood the Softwise.AI Privacy Policy.

Contact us!

By submitting my information, I confirm that I have read and understood the Softwise.AI Privacy Policy.

Contact us!

By submitting my information, I confirm that I have read and understood the Softwise.AI Privacy Policy.

Padewska 23/7

00-777 Warszawa

Tel.: +48 601 789 982

SOFTWISE.AI Sp. z o.o.

NIP: 9522254365

KRS: 0001097298

REGON: 528213750

Padewska 23/7

00-777 Warszawa

Tel.: +48 601 789 982

SOFTWISE.AI Sp. z o.o.

NIP: 9522254365

KRS: 0001097298

REGON: 528213750

Padewska 23/7

00-777 Warszawa

Tel.: +48 601 789 982

SOFTWISE.AI Sp. z o.o.

NIP: 9522254365

KRS: 0001097298

REGON: 528213750