WGIロゴ
Connect with us

WellGreen-i Co., Ltd.

– Capture key information quickly and accurately through WGI’s AI text mining technologies (A2K/LA2K) –

◆ In the following descriptions, the letter following each “Task” (e.g., Task A) corresponds to the codes (A–Q) listed under “Outsourcing task(s)” in the Contact Us form.

◆High-Precision and High-Throughput Literature Review using AI
― Accelerating the understanding of complex biological phenomena involving many kinds of genes, compounds, and growth-related environmental factors by harnessing AI-driven literature mining ―
By making literature reviews quicker and more precise, the time and effort previously spent on reviews can be redirected toward advancing the project.

◆WGI’s Proprietary High-Throughput Literature Review Service for Life Sciences Field (Task B)
― A2K / LA2K Technology ―

Notes on Automatically Collected and Analyzed Information in This Contracted Service

◆WGI’s Proprietary A2K/LA2K-Based Service for Collecting and Summarizing Knowledge-Based Information

Enhancing the time- and cost-efficiency, as well as the precision, of collecting knowledge-based information in the life sciences field.

◆Aggregation of Knowledge-based Information in the Life Sciences Using A2K Technology

WGI has developed AI text-mining technology, known as A2K (Article-to-Knowledge) Technology, which enables automated, high-precision, large-scale literature reviews to collect key information for each project. Using this technology, WGI offers services that collect and summarize key information from big data.


The A2K Advantage: Transforming Literature Search in Life Sciences
◆Development of LA2K Technology: A2K Technology Cannot Comprehensively Collect Gene Information

In A2K analysis, it can be difficult to comprehensively aggregate gene information when using gene names as key terms.

To solve this problem, WGI has developed Life Science A2K (LA2K) technology, which automatically recognizes gene names appearing in the text of papers and provides their A2K Descriptions.

◆LA2K Technology: AI-Based Automatic Recognition of Gene Names in Text

LA2K technology incorporates an AI-driven named-entity recognition (NER) method that automatically identifies gene names in text.

◆Example of Analysis Using LA2K Technology

LA2K technology collects documents related to the key term(s) and extracts sentences containing the key term(s) and/or gene name(s). In this process, the gene names that appear in the document are automatically recognized by the LA2K technology. The relevant information from these sentences is then summarized and presented in the A2K Description format.

An example of analysis using the LA2K technique is presented in the following figure. LA2K technology extracts key information related not only to the key term(s) but also to gene(s). In this example, the A2K Descriptions of the automatically recognized gene names in the document are shown.

An example of LA2K analysis using key terms:breast cancer, gene expression profiling, microarray analysis, and estrogen receptor alpha.


In this way, by utilizing LA2K technology, the computer can recognize automatically and correctly gene names in the text just like manual review by a researcher. Owing to the powerful text analysis capability of LA2K technology, key information related to genes can be extracted from documents even without prior knowledge of gene names by either the computer or the user.

Source: Yano K, Imai K, Shimizu A, Hanashita T., Nucleic Acids Research. 2006 Mar 14;34(5):1532–9., DOI: 10.1093/nar/gkl058 / PMID: 16537840
Parts of the text have been reconstructed as case studies of syntactic analysis, Creative Commons Attribution 4.0 International (CC BY 4.0) used under license.
◆Advantages and Future Prospects of LA2K
◆Acquisition of Gene Function Information: A Comparison between LA2K Technology and Conventional Method

In general, similarity analysis of DNA and amino acid sequences is widely used to predict gene function. While sequence similarity information is useful for studies such as molecular evolution and mutation analysis, it is not appropriate to rely on it alone for gene function prediction. Commonly used similarity analysis does not adequately account for the conservation of functional domains within sequences. To investigate gene functions with high reliability, literature surveys remain the most effective approach.
See more...

◆High-efficiency, high-accuracy information integration with A2K/LA2K technology
― beyond the reach of conventional manual review ―
― Liberation from the Enormous Time and Effort Required for Literature Surveys―

Although LA2K is an AI-based analytical technology and cannot guarantee the same accuracy as manual literature reviews conducted by domain experts, it offers Many advantages that make it a powerful tool for advancing life science research.

Advantages of Utilizing LA2K Technology
LA2K technology is an extremely powerful tool for a wide range of activities, such as research and development and business strategy planning.

◆Integration of A2K/LA2K Analysis Results with Omics Data to Accelerate the Identification of Genes and Compounds

Integrating of LA2K analysis results with omics data maximize the information for elucidating the molecular mechanisms of various biological processes. Information on transcription factors and cis-elements, gene expression data, and homolog information (gene families) within and across species are valuable as the omics data for the integration. Such integration of multifaceted information facilitates the achievement of research goals, including gene identification.

◆Features of A2K/LA2K Technology