<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet type="text/xsl" href="static/style.xsl"?><OAI-PMH xmlns="http://www.openarchives.org/OAI/2.0/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd"><responseDate>2026-04-14T03:06:55Z</responseDate><request verb="GetRecord" identifier="oai:www.recercat.cat:2117/101910" metadataPrefix="qdc">https://recercat.cat/oai/request</request><GetRecord><record><header><identifier>oai:recercat.cat:2117/101910</identifier><datestamp>2025-07-22T17:35:57Z</datestamp><setSpec>com_2072_1033</setSpec><setSpec>col_2072_452951</setSpec></header><metadata><qdc:qualifieddc xmlns:qdc="http://dspace.org/qualifieddc/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:doc="http://www.lyncode.com/xoai" xsi:schemaLocation="http://purl.org/dc/elements/1.1/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dc.xsd http://purl.org/dc/terms/ http://dublincore.org/schemas/xmls/qdc/2006/01/06/dcterms.xsd http://dspace.org/qualifieddc/ http://www.ukoln.ac.uk/metadata/dcmi/xmlschema/qualifieddc.xsd">
   <dc:title>Data mining in Breast Cancer</dc:title>
   <dc:creator>Carrio Viladrich, Laura</dc:creator>
   <dc:subject>Àrees temàtiques de la UPC::Informàtica</dc:subject>
   <dc:subject>Machine learning</dc:subject>
   <dc:subject>Databases</dc:subject>
   <dc:subject>Aprenentatge automàtic</dc:subject>
   <dc:subject>Bases de dades</dc:subject>
   <dcterms:abstract>Machine learning and data mining methods can be the future of the clinical decision&#xd;
process like pathological diagnosis. In this project we studied Breast Cancer Wisconsin&#xd;
dataset and applied different algorithms, concretely classifiers, in order to predict the&#xd;
diagnosis and the prognostic of the cancer.&#xd;
In order to classify the different types of cancer we divided the classification in two steps&#xd;
and we tested different algorithms for each step. The first step is the diagnosis&#xd;
classification. Diagnosis consistsin predict if the cancer is malignant and benign. And the&#xd;
second step is the prognostic classification. Prognostic consist in predict if cancer is&#xd;
recurrent or non-recurrent.&#xd;
After applying different models for each steps the result is that the best model to predict&#xd;
the diagnosis is the Decision Forest model. And the best model to predict the prognostic&#xd;
is the Boosted Decision Tree model.&#xd;
So, we conclude that the two step classifier with Decision Forest model and Boosted&#xd;
Decision Tree model is the best classifier.</dcterms:abstract>
   <dcterms:issued>2016</dcterms:issued>
   <dc:type>Bachelor thesis</dc:type>
   <dc:rights>Open Access</dc:rights>
   <dc:publisher>Universitat Politècnica de Catalunya</dc:publisher>
</qdc:qualifieddc></metadata></record></GetRecord></OAI-PMH>