×
Case Studies Blog Careers Technologies
Intelligent document processing and data extraction solution using open source and advanced ML techniques
Industry
FinTech
Technologies
Python, NumPy, Pandas
About Client

Our client, a well-recognized global Fintech company, processes large volumes of data from financial documents often placed through an agent under specified conditions of sale, which they receive from external agencies as an email attachment. There are 20+ agencies and each agency emails data in 12-15 different file formats. The client eventually receives 20,000 - 25,000 emails/documents daily.

Overview

Objectives

  • Simplify the manual process, enable analysis, and classification of such documents through digitization and extract the data they contain
  • Work across documents of different sizes, layouts, formats at scale
  • Integrate the extracted information with backend platforms on real-time basis with relevant audit logs
  • Ensure accurate transaction per minute processing performance to handle tens of thousands of documents

Solution

We built a custom solution that could extract data from documents of various sizes, layouts and formats at scale. 

  • Developed an independent microservice to read email and fetch attached documents at scale 
  • Leveraged multiple technologies including Python, Adobe, Pandas, Numpy and AWS Textract to address complexity and improve readability of documents and data 
  • Built run-time intelligence to minimize usage of high-cost components and make it cost-effective

Outcomes

The custom-built solutions helped the client meet all processing SLAs with business-ready information on a day-to-day basis resulting in significant reconciliation accuracy as well as huge cost and time savings; with negligible human intervention thereby reducing human errors and improving overall performance of the company

Testimonial
Choosing a digital partner for us was about more than capabilities — it’s about collaboration and business evolution. Whether our goal of building the world’s first AI automated ATM Security software application or Hyper Automation for Dispute Resolutions and Reconciliation or be it a transformation of our core business processes – The team Oneture is always there to partner with us to help us gain— and maintain—competitive advantage with efficient, sustainable models and tailored made solutions at scale. Over the last 2.5 years, their commitment to quality solution delivery and proactive approach in problem-solving, hands-on talent, agility is a key strength that built trust, not just with me but with my business partners too.
Rohit Kilam
CTO - CMS Info Systems