Back close

Dr. Peeta Basa Pati

Professor, School of Computing, Bengaluru

Qualification: BE, MSc, Ph.D
@amrita.edu
Research Interest: Camera Captured Document Analysis, Handwritten Document Analysis, Intelligent Document Processing, Machine Learning Applications to Document Processing

Bio

Dr. Peeta Basa Pati is a professor at Department of CSE, Amrita Vishwa Vidyapeetham, Bangalore. His research interests are in the field of document digitization and information capture. He has 20+ years of experience in this field which includes 15+ years of industrial experience. Prior to joining Amrita, Dr Pati was working at Cognizant Technology Solutions as a chief architect. In this role, he has built and managed IDP systems, implemented and successfully productionized IDP systems for multiple business domains and organizations. As part of this he has experience in dealing with documents which are structured as well as unstructured, type written & handwritten, dealing with documents with graphical information content.

Publications

Journal Article

Year : 2008

Word level multi-script identification

Cite this Research Publication : Peeta Basa Pati and Ramakrishnan, A. G., “Word level multi-script identification”, Pattern Recognition Letters, vol. 29, pp. 1218-1229, 2008.

Publisher : Pattern Recognition Letters

Year : 2005

OCR in Indian Scripts: A Survey

Cite this Research Publication : Peeta Basa Pati and Ramakrishnan, A. G., “OCR in Indian Scripts: A Survey”, IETE Technical Review, vol. 22, pp. 217-227, 2005.

Publisher : Taylor & Francis

Year : 2002

Script Identification in Printed Bilingual Documents

Cite this Research Publication : D. Dhanya and Ramakrishnan, A. G., “Script Identification in Printed Bilingual Documents”, Document Analysis Systems V, vol. 27, pp. 73-82, 2002.

Publisher : Springer Berlin Heidelberg

Conference Paper

Year : 2022

Handwriting Quality Assessment using Structural Features and Support Vector Machines

Cite this Research Publication : "Handwriting Quality Assessment using Structural Features and Support Vector Machines", P B Pati, Chandana G V, C Rithish Reddy, G Balaji Subash & Jasti SriHarsha, 2022 Int. Conf. for Convergence in Technology, Pune, India, Apr, 2022.

Publisher : Int. Conf. for Convergence in Technology

Year : 2022

Speech to Equation Conversion using a PoE Tagger

Cite this Research Publication : "Speech to Equation Conversion using a PoE Tagger", P B Pati & Shreyas V, 2022 Int. Conf. for Convergence in Technology, Pune, India, Apr, 2022.

Publisher : Int. Conf. for Convergence in Technology

Year : 2022

System for Identification of References

Cite this Research Publication : "System for Identification of References", Vaishak Sajeev & P B Pati, 2022 IEEE 2ND Asian Conference on Innovation in Technology (ASIANCON2022), Aug 2022. Pune, INDIA.

Publisher : ASIANCON2022

Year : 2022

Performance Analysis of Machine Learning Algorithms on Multi-Touch Attribution Model

Cite this Research Publication : "Performance Analysis of Machine Learning Algorithms on Multi-Touch Attribution Model", Satyabrata Pattanayak, P B Pati, Tripty Singh, IEEE 3rd International Conference of Emerging Technologies (INCET), Belgaum, India, May 2022.

Publisher : INCET

Year : 2022

Accuracy Comparison of Neural Models for Spelling Correction in Handwriting OCR Data

Cite this Research Publication : "ACCURACY COMPARISION OF NEURAL MODELS FOR SPELLING CORRECTION IN HANDWRITING OCR DATA", Shivalila H, P B Pati & Neelima N, 4th International Conference on Communication, Computing and Electronics Systems, (ICCCES - 2022), Virtual, Sep 2022

Publisher : ICCCES

Year : 2022

Prediction of Diabetes Using Machine Learning: Analysis of 70,000 Clinical Database Patient Record

Cite this Research Publication : S. M. Kuriakose, P. Basa Pati and T. Singh, "Prediction of Diabetes Using Machine Learning: Analysis of 70,000 Clinical Database Patient Record," 13th International Conference on Computing Communication and Networking Technologies (ICCCNT), Kharagpur, India, IEEE, 2022, pp. 1-5, doi: 10.1109/ICCCNT54827.2022.9984264.

Publisher : IEEE

Year : 2022

Offline HWR Accuracy Enhancement with Image Enhancement and Deep Learning Techniques

Cite this Research Publication : "Offline HWR Accuracy Enhancement with Image Enhancement and Deep Learning Techniques", Aashu Kumar & P B Pati, International Conference on Machine Learning and Data Engineering (ICMLDE2022), Sep 2022, Dehradun, INDIA.

Year : 2022

A Framework for the prediction of Diabetes Mellitus using Hyper-Parameter tuned XGBoost Classifier

Cite this Research Publication : Gayathri R, P B Pati & Tripty Singh, "A Framework for the prediction of Diabetes Mellitus using Hyper-Parameter tuned XGBoost Classifier", 13th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Virtual, Oct 2022.

Conference Proceedings

Year : 2009

Industry-Academia Collaboration via Internships

Cite this Research Publication : S. Sivananda, Sathyanarayana, V., and Peeta Basa Pati, “Industry-Academia Collaboration via Internships”, 2009 22nd Conference on Software Engineering Education and Training. 2009.

Publisher : 2009 22nd Conference on Software Engineering Education and Training

Year : 2007

A Blind Indic Script Recognizer for Multi-script Documents

Cite this Research Publication : Peeta Basa Pati and Ramakrishnan, A., “A Blind Indic Script Recognizer for Multi-script Documents”, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007). pp. 1248-1252, 2007.

Publisher : Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Year : 2007

Automatic Seal Information Reader

Cite this Research Publication : F. Nourbakhsh, Peeta Basa Pati, and Ramakrishnan, A. G., “Automatic Seal Information Reader”, 2007 International Conference on Computing: Theory and Applications (ICCTA'07). pp. 502-505, 2007.

Publisher : 2007 International Conference on Computing: Theory and Applications (ICCTA'07)

Year : 2006

Text Localization and Extraction from Complex Gray Images

Cite this Research Publication : F. Nourbakhsh, Peeta Basa Pati, and Ramakrishnan, A. G., “Text Localization and Extraction from Complex Gray Images”, Computer Vision, Graphics and Image Processing. Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 776-785, 2006.

Publisher : Springer Berlin Heidelberg

Year : 2006

HVS Inspired System for Script Identification in Indian Multi-script Documents

Cite this Research Publication : Peeta Basa Pati and Ramakrishnan, A. G., “HVS Inspired System for Script Identification in Indian Multi-script Documents”, Document Analysis Systems VII. Springer Berlin Heidelberg, Berlin, Heidelberg, 2006.

Publisher : Springer Berlin Heidelberg

Year : 2006

Document Page Layout Analysis Using Harris Corner Points

Cite this Research Publication : F. Nourbakhsh, Peeta Basa Pati, and Ramakrishnan, A. G., “Document Page Layout Analysis Using Harris Corner Points”, 2006 Fourth International Conference on Intelligent Sensing and Information Processing. pp. 149-152, 2006.

Publisher : Fourth International Conference on Intelligent Sensing and Information Processing

Year : 2006

Automatic text block separation in document images

Cite this Research Publication : K. R. Arvind, Peeta Basa Pati, and Ramakrishnan, A. G., “Automatic text block separation in document images”, 2006 Fourth International Conference on Intelligent Sensing and Information Processing. pp. 53-58, 2006.

Publisher : Fourth International Conference on Intelligent Sensing and Information Processing

Year : 2006

Horizontal Projection Profiles for extraction of Text Paragraphs from Document Images

Cite this Research Publication : Peeta Basa Pati, R, A. K., and Ramakrishnan, A. G., “Horizontal Projection Profiles for extraction of Text Paragraphs from Document Images”, IEEE Intl. Conf. on Sig. & Im. Proc. 2006.

Publisher : IEEE

Year : 2006

Can Biological Motion be a Biometric?

Cite this Research Publication : Peeta Basa Pati and Ramakrishnan, A. G., “Can Biological Motion be a Biometric?”, 2006 Fourth International Conference on Intelligent Sensing and Information Processing. pp. 2-4, 2006.

Publisher : Fourth International Conference on Intelligent Sensing and Information Processing

Year : 2006

Binarization and Localization of Text Images Captured on a Mobile Phone Camera

Cite this Research Publication : B. Antony, Peeta Basa Pati, and Ramakrishnan, A. G., “Binarization and Localization of Text Images Captured on a Mobile Phone Camera”, 2006 Fourth International Conference on Intelligent Sensing and Information Processing. pp. .224 – 229, 2006.

Publisher : Fourth International Conference on Intelligent Sensing and Information Processing

Year : 2005

Text Localization and Extraction from Complex Color Images

Cite this Research Publication : S. S. Raju, Peeta Basa Pati, and Ramakrishnan, A. G., “Text Localization and Extraction from Complex Color Images”, Advances in Visual Computing. Springer Berlin Heidelberg, Berlin, Heidelberg, pp. 486-493, 2005.

Publisher : Advances in Visual Computing, Springer Berlin Heidelberg, Berlin

Year : 2004

Gabor filters for document analysis in Indian bilingual documents

Cite this Research Publication : Peeta Basa Pati, S. Raju, S., Pati, N., and Ramakrishnan, A. G., “Gabor filters for document analysis in Indian bilingual documents”, International Conference on Intelligent Sensing and Information Processing, 2004. Proceedings of. pp. 123-126, 2004.

Publisher : International Conference on Intelligent Sensing and Information Processing

Year : 2004

Gabor filter based block energy analysis for text extraction from digital document images

Cite this Research Publication : S. S. Raju, Peeta Basa Pati, and Ramakrishnan, A. G., “Gabor filter based block energy analysis for text extraction from digital document images”, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings. pp. 233-243, 2004.

Publisher : First International Workshop on Document Image Analysis for Libraries

Year : 2000

Printed Odiya Character recognition System

Cite this Research Publication : Peeta Basa Pati and Ramakrishnan, A. G., “Printed Odiya Character recognition System”, Proc. of Conf. of Information Technology, Dec’2000. . 2000.

Publisher : Proc. of Conf. of Information Technology

Book Chapter

Year : 2010

Design of a Bilingual Kannada–English OCR

Cite this Research Publication : R. S. Umesh, Peeta Basa Pati, and Ramakrishnan, A. G., “Design of a Bilingual Kannada–English OCR”, in Book Chap in Guide to OCR in Indic Scripts, Springer-Verlag, V. Govindaraju and Setlur, S. (Ranga), Eds. London: Springer London, 2010, pp. 97–124.

Publisher : Springer-Verlag,

Patents

Year : 2019

System and method for automated processing of electronic documents

Cite this Research Publication : Peeta Basa Pati, “System and method for automated processing of electronic documents”, 2019.

Publisher : Patent granted by USPTO

Year : 2019

A System and a Method for Developing a Tool for Automated Data Capture

Cite this Research Publication : Peeta Basa Pati, “A System and a Method for Developing a Tool for Automated Data Capture”, 2019.

Year : 2014

Data extraction confidence attribute with transformations

Cite this Research Publication : Peeta Basa Pati, “Data extraction confidence attribute with transformations”, 2014.

Publisher : Patent granted by USPTO

Year : 2013

Automatic data validation and correction

Cite this Research Publication : Peeta Basa Pati, “Automatic data validation and correction”, 2013.

Publisher : Patent granted by USPTO

Education
  • 2007: Ph.D.
    Indian Institute of Science, Bangalore
  • 2001: M.Sc. (Engg.)
    Indian Institute of Science, Bangalore
  • 1998: B.E. Electrical Engineering
    NIT (REC) Rourkela
Professional Appointments
Year Affiliation
June 2021- Present Professor, Department of Computer Science and Engineering, School of Engineering, Amrita Vishwa Vidyapeetham, Bengaluru
2007 – 2021 Chief Architect, Cognizant Technology Solutions

Research & Management Experience

  • 22 years of research experience
  • 14 years of experience in project, people and product management

Major Research Interests

  • Document Image Processing, Machine Learning, Text Engineering and Natural Language Processing

Membership in Professional Bodies

  • Senior Member – IEEE
Keynote Addresses/Invited Talks/Panel Memberships
  • Tutorials
    • “Information capture from Documents at Industrial Scale”, Tutorial presented at IEEE-iSES 2021, MNIT Jaipur, INDIA, Dec’21.
    • “Image Analysis with optimal space-frequency filters,” along with Prof. A G Ramakrishnan at Intl. Conf. on Systemic, Cybernetics and Informatics – 2007, Hyderabad, Jan’07.
    • “Gabor Filters for Image Processing,” along with Prof. A G Ramakrishnan at Intl. Conf. on Info. Tech. – 2006, Bhubaneswar, Dec’06.
    • “OCR and Handwriting Analysis” along with Prof. A G Ramakrishnan at Intl. Conf. on Systemic, Cybernetics and Informatics (ICSCI 2005), Hyderabad, Jan’05.
    •  “Biometric based Person Identification System,” along with Prof. A G Ramakrishnan at Conf. on Info. Tech. 2003, Bhubaneswar, Dec’03.
  • Invited Talks
    • “Non-Linear ML Model-Decision Tree, Entropy, Information Gain, Overfitting Vs Under fitting”, July 4, 2022, organized by Department of IT, Indira Gandhi Delhi Technical University for Women, Delhi.
    • “Decision Tree – classification & regression”, Jun 26, 2022, part of lecture series in Odisha ML Summer School 2022
    • “Data Science – Introduction & applications in Education”, Apr 22, 2022, STEM@NEP2020 Series, jointly organized by Edudevs and Amrita Vishwa Vidyapeetham for school teachers and administrators across India.
    • “Artificial Intelligence – Introduction, Applications & Future Scope”, Mar 20, 2022, organized by Visual Media Technologies for college students & faculty members in & around Krishnagiri, TN, India.
    • “Machine Learning – Introduction & application in Education”, Mar 11, 2022, STEM@NEP2020 Series, jointly organized by Edudevs and Amrita Vishwa Vidyapeetham for school teachers and administrators across India.
    • “Information capture from Documents at Industrial Scale”, Tutorial presented at IEEE-iSES 2021. “Life in Industrial Set-up: An Introduction”, Sep 24, 2021, M Tech & B Tech Inauguration, Amrita School of Engineering, Bengaluru, India.
    • “Gabor Filters in Digital Image Processing: An Intuitive Exploration”, August 27, 2021, Amrita School of Arts & Sciences, Mysuru, India.
    • “Intelligent Document Processing (IDP) Systems”, July 31, 2021, Amrita School of Engineering, Bengaluru, India.
    • “Introduction to Data Mining Techniques and Applications,” delivered a lecture series at Cognizant, Apr – May 2012.
    • “Image Restoration,” PES College of Engg., Bangalore, Sep’12.
    • “Introduction to Color Image Processing,” PES College of Engg., Bangalore, Oct’11.
    • “Image Analysis: Prospective and Challenges,” PES College of Engg., Bangalore, Jul’11.
    • “Challenges before Intelligent Document Analysis – An Overview,” at Institute of Technology and Education Research, Bhubaneswar, Nov’09.
    • “Document Image Analysis,” at Silicon Inst. of Tech., Bhubaneswar, Dec’08.
    •  “Text Localization in Complex Color Images,” at HP-Labs India, Bangalore, Feb’06.
    • “Document Image Analysis,” at NMAM Inst. of Tech., Nitte, Karnataka, Feb’06.
    • “Image Enhancement Techniques,” at MSR School of Advanced Studies, Mar’03.
    • “Principal Component Analysis (PCA) for Human Face Recognition,” at Regional College of Management, Bhubaneswar, Feb’03.
    • “Linear Techniques of Face Detection and Recognition,” at Institute of Technology and Education Research, Bhubaneswar, Feb’03.
  • Technical chair in international conferences
    • Member Program committee for ISED 2010
    • Member Program Committee for Summer school on Document Image Processing – 2008, conducted at CCE, IISc. Bangalore.
    • Tutorial Chair of the International Conf. on Info. Tech – 2006 & Publicity chair for ICIT – 2007.
    • Member of program or technical review committee for ISVC – 2005, 06, 07, 08, 09 & 2010 and ICIT – 2006, 07, 08 & 09
  • Reviewed research papers for journals and conferences
    • Springer Journal on Circuits, Systems & Signal Processing
    • IEE Proc. Vision, Image & Signal Processing journal
    • ISVC, USA – 2005, 06, 07, 08, 09 & 2010
    • ICIT, India – 2006, 07, 08 & 09
Courses Taught
  • Digital Image Processing
  • Introduction to Data Mining & Machine Learning
  • Biomedical Signal and Image Processing
  • Basic Electrical Engineering
Student Guidance

Postgraduate Students

Sl. No. Name of the Student(s) Topic Status – Ongoing/Completed Year of Completion
1 Sabari Raju S. Text Extraction of Complex Color Documents Completed 2004
2 Farshad Nourbakhsh Extraction of text, seal and handwritten in document images Completed 2006

Name and Area of PhD Scholars Being Guided (Undergoing)

  • Ms Shaveta Khepra — Document digitization
  • Ms Remya Sivan — Palm leaf digitization
  • Ms Priyanka Prabhakar — Legal document summarization
Research Scholars

⦁ Ms. Shaveta Khepra
⦁ Ms. Surya S (Co-Supervision with Dr Manoj Panda, ECE Department)
⦁ Ms. Priyanka Prabhakar
⦁ Ms. Remya Sivan
⦁ Ms. Roshni M
⦁ Mr. Dileepkumar K S
⦁ Mr. Bhavith M P
⦁ Dr. Anand M
⦁ Ms. Kusuma J (Co-Supervision with Dr Nidhin Prabhakar, CSE Department)

Admissions Apply Now