<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-17T20:21:55Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:10230/72077" metadataPrefix="oai_dc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:10230/72077</identifier><datestamp>2025-12-05T15:39:34Z</datestamp><setSpec>com_2072_6</setSpec><setSpec>col_2072_452952</setSpec></header><metadata><oai_dc:dc xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd">
   <dc:title>PretoxTM: a text mining system for extracting treatment-related findings from preclinical toxicology reports</dc:title>
   <dc:creator>Corvi, Javier</dc:creator>
   <dc:creator>Ronzano, Francesco</dc:creator>
   <dc:creator>Centeno, Emilio</dc:creator>
   <dc:creator>Capella Gutiérrez, Salvador Jesús, 1985-</dc:creator>
   <dc:subject>Adverse effect</dc:subject>
   <dc:subject>Animal model</dc:subject>
   <dc:subject>Annotated corpora</dc:subject>
   <dc:subject>Compound</dc:subject>
   <dc:subject>Drug</dc:subject>
   <dc:subject>Natural language processing</dc:subject>
   <dc:subject>Preclinical</dc:subject>
   <dc:subject>Text mining</dc:subject>
   <dc:subject>Toxicology</dc:subject>
   <dc:description>Over the last few decades the pharmaceutical industry has generated a vast corpus of knowledge on the safety and efficacy of drugs. Much of this information is contained in toxicology reports, which summarise the results of animal studies designed to analyse the effects of the tested compound, including unintended pharmacological and toxic effects, known as treatment-related findings. Despite the potential of this knowledge, the fact that most of this relevant information is only available as unstructured text with variable degrees of digitisation has hampered its systematic access, use and exploitation. Text mining technologies have the ability to automatically extract, analyse and aggregate such information, providing valuable new insights into the drug discovery and development process. In the context of the eTRANSAFE project, we present PretoxTM (Preclinical Toxicology Text Mining), the first system specifically designed to detect, extract, organise and visualise treatment-related findings from toxicology reports. The PretoxTM tool comprises three main components: PretoxTM Corpus, PretoxTM Pipeline and PretoxTM Web App. The PretoxTM Corpus is a gold standard corpus of preclinical treatment-related findings annotated by toxicology experts. This corpus was used to develop, train and validate the PretoxTM Pipeline, which extracts treatment-related findings from preclinical study reports. The extracted information is then presented for expert visualisation and validation in the PretoxTM Web App. Scientific Contribution While text mining solutions have been widely used in the clinical domain to identify adverse drug reactions from various sources, no similar systems exist for identifying adverse events in animal models during preclinical testing. PretoxTM fills this gap by efficiently extracting treatment-related findings from preclinical toxicology reports. This provides a valuable resource for toxicology research, enhancing the efficiency of safety evaluations, saving time, and leading to more effective decision-making in the drug development process.</dc:description>
   <dc:description>JC was partly supported by eTRANSAFE (received funding from the Innovative Medicines Initiative 2 Joint Undertaking under grant agreement No 777365 and support from the European Union's Horizon 2020 research and innovation programme and EFPIA).</dc:description>
   <dc:date>2025-12-01T17:31:53Z</dc:date>
   <dc:date>2025-12-01T17:31:53Z</dc:date>
   <dc:date>2025</dc:date>
   <dc:date>2025-12-01T17:31:53Z</dc:date>
   <dc:type>info:eu-repo/semantics/article</dc:type>
   <dc:type>info:eu-repo/semantics/publishedVersion</dc:type>
   <dc:identifier>Corvi J, Diaz-Roussel N, Fernandez JM, Ronzano F, Centeno E, Accuosto P, Ibrahim C, Asakura S, Bringezu F, Frohlicher M, Kreuchwig A, Nogami Y, Rih J, Rodriguez-Esteban R, Sajot N, Wichard J, Wu HYM, Drew P, Steger-Hartmann T, Valencia A, Furlong LI, Capella-Gutierrez S. PretoxTM: a text mining system for extracting treatment-related findings from preclinical toxicology reports. J Cheminform. 2025 Feb 3;17(1):15. DOI: 10.1186/s13321-024-00925-x</dc:identifier>
   <dc:identifier>1758-2946</dc:identifier>
   <dc:identifier>http://hdl.handle.net/10230/72077</dc:identifier>
   <dc:identifier>http://dx.doi.org/10.1186/s13321-024-00925-x</dc:identifier>
   <dc:identifier>http://hdl.handle.net/10230/72077</dc:identifier>
   <dc:language>eng</dc:language>
   <dc:relation>Journal of Cheminformatics. 2025;17(1):15</dc:relation>
   <dc:relation>info:eu-repo/grantAgreement/EC/H2020/777365</dc:relation>
   <dc:rights>© The Author(s) 2025. Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.</dc:rights>
   <dc:rights>http://creativecommons.org/licenses/by/4.0/</dc:rights>
   <dc:rights>info:eu-repo/semantics/openAccess</dc:rights>
   <dc:format>application/pdf</dc:format>
   <dc:format>application/pdf</dc:format>
   <dc:publisher>BioMed Central</dc:publisher>
</oai_dc:dc></metadata></record></GetRecord></OAI-PMH>