OCMR: A comprehensive framework for optical chemical molecular recognition

Comput Biol Med. 2023 Sep:163:107187. doi: 10.1016/j.compbiomed.2023.107187. Epub 2023 Jun 27.

Abstract

Artificial intelligence (AI) has achieved significant progress in the field of drug discovery. AI-based tools have been used in all aspects of drug discovery, including chemical structure recognition. We propose a chemical structure recognition framework, Optical Chemical Molecular Recognition (OCMR), to improve the data extraction capability in practical scenarios compared with the rule-based and end-to-end deep learning models. The proposed OCMR framework enhances the recognition performances via the integration of local information in the topology of molecular graphs. OCMR handles complex tasks like non-canonical drawing and atomic group abbreviation and substantially improves the current state-of-the-art results on multiple public benchmark datasets and one internally curated dataset.

Keywords: Bioinformatics; Chemical informatics; Chemical structure recognition; Molecular graph; OCMR; OCSR.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence*
  • Benchmarking*
  • Drug Discovery