-
iASiS: Towards Heterogeneous Big Data Analysis for Personalized Medicine
Authors:
Anastasia Krithara,
Fotis Aisopos,
Vassiliki Rentoumi,
Anastasios Nentidis,
Konstantinos Bougatiotis,
Maria-Esther Vidal,
Ernestina Menasalvas,
Alejandro Rodriguez-Gonzalez,
Eleftherios G. Samaras,
Peter Garrard,
Maria Torrente,
Mariano Provencio Pulla,
Nikos Dimakopoulos,
Rui Mauricio,
Jordi Rambla De Argila,
Gian Gaetano Tartaglia,
George Paliouras
Abstract:
The vision of IASIS project is to turn the wave of big biomedical data heading our way into actionable knowledge for decision makers. This is achieved by integrating data from disparate sources, including genomics, electronic health records and bibliography, and applying advanced analytics methods to discover useful patterns. The goal is to turn large amounts of available data into actionable info…
▽ More
The vision of IASIS project is to turn the wave of big biomedical data heading our way into actionable knowledge for decision makers. This is achieved by integrating data from disparate sources, including genomics, electronic health records and bibliography, and applying advanced analytics methods to discover useful patterns. The goal is to turn large amounts of available data into actionable information to authorities for planning public health activities and policies. The integration and analysis of these heterogeneous sources of information will enable the best decisions to be made, allowing for diagnosis and treatment to be personalised to each individual. The project offers a common representation schema for the heterogeneous data sources. The iASiS infrastructure is able to convert clinical notes into usable data, combine them with genomic data, related bibliography, image data and more, and create a global knowledge base. This facilitates the use of intelligent methods in order to discover useful patterns across different resources. Using semantic integration of data gives the opportunity to generate information that is rich, auditable and reliable. This information can be used to provide better care, reduce errors and create more confidence in sharing data, thus providing more insights and opportunities. Data resources for two different disease categories are explored within the iASiS use cases, dementia and lung cancer.
△ Less
Submitted 9 July, 2024;
originally announced July 2024.
-
Irreversible reinsurance: Minimization of Capital Injections in Presence of a Fixed Cost
Authors:
Salvatore Federico,
Giorgio Ferrari,
Maria-Laura Torrente
Abstract:
We propose a model in which, in exchange to the payment of a fixed transaction cost, an insurance company can choose the retention level as well as the time at which subscribing a perpetual reinsurance contract. The surplus process of the insurance company evolves according to the diffusive approximation of the Cramér-Lundberg model, claims arrive at a fixed constant rate, and the distribution of…
▽ More
We propose a model in which, in exchange to the payment of a fixed transaction cost, an insurance company can choose the retention level as well as the time at which subscribing a perpetual reinsurance contract. The surplus process of the insurance company evolves according to the diffusive approximation of the Cramér-Lundberg model, claims arrive at a fixed constant rate, and the distribution of their sizes is general. Furthermore, we do not specify any specific functional form of the retention level. The aim of the company is to take actions in order to minimize the sum of the expected value of the total discounted flow of capital injections needed to avoid bankruptcy and of the fixed activation cost of the reinsurance contract. We provide an explicit solution to this problem, which involves the resolution of a static nonlinear optimization problem and of an optimal stopping problem for a reflected diffusion. We then illustrate the theoretical results in the case of proportional and excess-of-loss reinsurance, by providing a numerical study of the dependency of the optimal solution with respect to the model's parameters.
△ Less
Submitted 11 February, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Machine Learning-Assisted Recurrence Prediction for Early-Stage Non-Small-Cell Lung Cancer Patients
Authors:
Adrianna Janik,
Maria Torrente,
Luca Costabello,
Virginia Calvo,
Brian Walsh,
Carlos Camps,
Sameh K. Mohamed,
Ana L. Ortega,
Vít Nováček,
Bartomeu Massutí,
Pasquale Minervini,
M. Rosario Garcia Campelo,
Edel del Barco,
Joaquim Bosch-Barrera,
Ernestina Menasalvas,
Mohan Timilsina,
Mariano Provencio
Abstract:
Background: Stratifying cancer patients according to risk of relapse can personalize their care. In this work, we provide an answer to the following research question: How to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients?
Methods: For predicting relapse in 1,387 early-stage (I-II), non-small-cell lung cancer (NSCLC) patients from t…
▽ More
Background: Stratifying cancer patients according to risk of relapse can personalize their care. In this work, we provide an answer to the following research question: How to utilize machine learning to estimate probability of relapse in early-stage non-small-cell lung cancer patients?
Methods: For predicting relapse in 1,387 early-stage (I-II), non-small-cell lung cancer (NSCLC) patients from the Spanish Lung Cancer Group data (65.7 average age, 24.8% females, 75.2% males) we train tabular and graph machine learning models. We generate automatic explanations for the predictions of such models. For models trained on tabular data, we adopt SHAP local explanations to gauge how each patient feature contributes to the predicted outcome. We explain graph machine learning predictions with an example-based method that highlights influential past patients. Results: Machine learning models trained on tabular data exhibit a 76% accuracy for the Random Forest model at predicting relapse evaluated with a 10-fold cross-validation (model was trained 10 times with different independent sets of patients in test, train and validation sets, the reported metrics are averaged over these 10 test sets). Graph machine learning reaches 68% accuracy over a 200-patient, held-out test set, calibrated on a held-out set of 100 patients. Conclusions: Our results show that machine learning models trained on tabular and graph data can enable objective, personalised and reproducible prediction of relapse and therefore, disease outcome in patients with early-stage NSCLC. With further prospective and multisite validation, and additional radiological and molecular data, this prognostic model could potentially serve as a predictive decision support tool for deciding the use of adjuvant treatments in early-stage lung cancer. Keywords: Non-Small-Cell Lung Cancer, Tumor Recurrence Prediction, Machine Learning
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
Optimal Vaccination in a SIRS Epidemic Model
Authors:
Salvatore Federico,
Giorgio Ferrari,
Maria-Laura Torrente
Abstract:
We propose and solve an optimal vaccination problem within a deterministic compartmental model of SIRS type: the immunized population can become susceptible again, e.g.\ because of a not complete immunization power of the vaccine. A social planner thus aims at reducing the number of susceptible individuals via a vaccination campaign, while minimizing the social and economic costs related to the in…
▽ More
We propose and solve an optimal vaccination problem within a deterministic compartmental model of SIRS type: the immunized population can become susceptible again, e.g.\ because of a not complete immunization power of the vaccine. A social planner thus aims at reducing the number of susceptible individuals via a vaccination campaign, while minimizing the social and economic costs related to the infectious disease. As a theoretical contribution, we provide a technical non-smooth verification theorem, guaranteeing that a semiconcave viscosity solution to the Hamilton-Jacobi-Bellman equation identifies with the minimal cost function, provided that the closed-loop equation admits a solution. Conditions under which the closed-loop equation is well-posed are then derived by borrowing results from the theory of \emph{Regular Lagrangian Flows}. From the applied point of view, we provide a numerical implementation of the model in a case study with quadratic instantaneous costs. Amongst other conclusions, we observe that in the long-run the optimal vaccination policy is able to keep the percentage of infected to zero, at least when the natural reproduction number and the reinfection rate are small.
△ Less
Submitted 7 June, 2022;
originally announced June 2022.
-
Quartic monoid surfaces with maximum number of lines
Authors:
Mauro Carlo Beltrametti,
Alessandro Logar,
Maria Laura Torrente
Abstract:
In 1884 the German mathematician Karl Rohn published a substantial paper on \cite{ROH} on the properties of quartic surfaces with triple points, proving (among many other things) that the maximum number of lines contained in a quartic monoid surface is $31$. In this paper we study in details this class of surfaces. We prove that there exists an open subset $A \subseteq \mathbb{P}^1_K$ ($K$ is a ch…
▽ More
In 1884 the German mathematician Karl Rohn published a substantial paper on \cite{ROH} on the properties of quartic surfaces with triple points, proving (among many other things) that the maximum number of lines contained in a quartic monoid surface is $31$. In this paper we study in details this class of surfaces. We prove that there exists an open subset $A \subseteq \mathbb{P}^1_K$ ($K$ is a characteristic zero field) that parametrizes (up to a projectivity) all the quartic monoid surfaces with $31$ lines; then we study the action of $\mathrm{PGL}(4,K)$ on these surfaces, we show that the stabiliser of each of them is a group isomorphic to $S_3$ except for one surface of the family, whose stabiliser is a group isomorphic to $S_3 \times C_3$. Finally we show that the $j$-invariant allows one to decide, also in this situation, when two elements of $A$ give the same surface up to a projectivity. To get our results, several computational tools, available in computer algebra systems, are used.
△ Less
Submitted 17 December, 2019;
originally announced December 2019.
-
Geometry of the Hough transforms with applications to synthetic data
Authors:
Mauro C. Beltrametti,
Cristina Campi,
Anna Maria Massone,
Maria-Laura Torrente
Abstract:
In the framework of the Hough transform technique to detect curves in images, we provide a bound for the number of Hough transforms to be considered for a successful optimization of the accumulator function in the recognition algorithm. Such a bound is consequence of geometrical arguments. We also show the robustness of the results when applied to synthetic datasets strongly perturbed by noise. An…
▽ More
In the framework of the Hough transform technique to detect curves in images, we provide a bound for the number of Hough transforms to be considered for a successful optimization of the accumulator function in the recognition algorithm. Such a bound is consequence of geometrical arguments. We also show the robustness of the results when applied to synthetic datasets strongly perturbed by noise. An algebraic approach, discussed in the appendix, leads to a better bound of theoretical interest in the exact case.
△ Less
Submitted 4 April, 2019;
originally announced April 2019.
-
IASIS and BigMedilytics: Towards personalized medicine in Europe
Authors:
Ernestina Menasalvas Ruiz,
Alejandro Rodríguez-González,
Consuelo Gonzalo Martín,
Massimiliano Zanin,
Juan Manuel Tuñas,
Mariano Provencio,
Maria Torrente,
Fabio Franco,
Virginia Calvo,
Beatriz Nuñez
Abstract:
One field of application of Big Data and Artificial Intelligence that is receiving increasing attention is the biomedical domain. The huge volume of data that is customary generated by hospitals and pharmaceutical companies all over the world could potentially enable a plethora of new applications. Yet, due to the complexity of such data, this comes at a high cost. We here review the activities of…
▽ More
One field of application of Big Data and Artificial Intelligence that is receiving increasing attention is the biomedical domain. The huge volume of data that is customary generated by hospitals and pharmaceutical companies all over the world could potentially enable a plethora of new applications. Yet, due to the complexity of such data, this comes at a high cost. We here review the activities of the research group composed by people of the Universidad Politécnica de Madrid and the Hospital Universitario Puerta de Hierro de Majadahonda, Spain; discuss their activities within two European projects, IASIS and BigMedilytics; and present some initial results.
△ Less
Submitted 20 September, 2018;
originally announced September 2018.
-
Recognition of feature curves on 3D shapes using an algebraic approach to Hough transforms
Authors:
Maria-Laura Torrente,
Silvia Biasotti,
Bianca Falcidieno
Abstract:
Feature curves are largely adopted to highlight shape features, such as sharp lines, or to divide surfaces into meaningful segments, like convex or concave regions. Extracting these curves is not sufficient to convey prominent and meaningful information about a shape. We have first to separate the curves belonging to features from those caused by noise and then to select the lines, which describe…
▽ More
Feature curves are largely adopted to highlight shape features, such as sharp lines, or to divide surfaces into meaningful segments, like convex or concave regions. Extracting these curves is not sufficient to convey prominent and meaningful information about a shape. We have first to separate the curves belonging to features from those caused by noise and then to select the lines, which describe non-trivial portions of a surface. The automatic detection of such features is crucial for the identification and/or annotation of relevant parts of a given shape. To do this, the Hough transform (HT) is a feature extraction technique widely used in image analysis, computer vision and digital image processing, while, for 3D shapes, the extraction of salient feature curves is still an open problem.
Thanks to algebraic geometry concepts, the HT technique has been recently extended to include a vast class of algebraic curves, thus proving to be a competitive tool for yielding an explicit representation of the diverse feature lines equations. In the paper, for the first time we apply this novel extension of the HT technique to the realm of 3D shapes in order to identify and localize semantic features like patterns, decorations or anatomical details on 3D objects (both complete and fragments), even in the case of features partially damaged or incomplete. The method recognizes various features, possibly compound, and it selects the most suitable feature profiles among families of algebraic curves.
△ Less
Submitted 28 September, 2017;
originally announced September 2017.
-
Corrigendum for "Almost vanishing polynomials and an application to the Hough transform"
Authors:
Maria-Laura Torrente,
Mauro C. Beltrametti
Abstract:
In this note we correct a technical error occurred in [M. Torrente and M.C. Beltrametti, "Almost vanishing polynomials and an application to the Hough transform", J. Algebra Appl. 13(8), (2014)]. This affects the bounds given in that paper, even though the structure and the logic of all proofs remain fully unchanged.
In this note we correct a technical error occurred in [M. Torrente and M.C. Beltrametti, "Almost vanishing polynomials and an application to the Hough transform", J. Algebra Appl. 13(8), (2014)]. This affects the bounds given in that paper, even though the structure and the logic of all proofs remain fully unchanged.
△ Less
Submitted 29 April, 2017;
originally announced May 2017.
-
Approximation by Spline Curves: towards an Application to Cognitive Neuroscience
Authors:
Maria-Laura Torrente,
Stefano Anzellotti,
Chiara Finocchiaro,
Claudio Fontanari
Abstract:
We present a procedure to approximate a plane contour by piecewise polynomial functions, depending on various parameters, such as degree, number of local patches, selection of knots. This procedure aims to be adopted to study how information about shape is represented.
We present a procedure to approximate a plane contour by piecewise polynomial functions, depending on various parameters, such as degree, number of local patches, selection of knots. This procedure aims to be adopted to study how information about shape is represented.
△ Less
Submitted 14 July, 2015;
originally announced July 2015.
-
An alternative algorithm for computing the Betti table of a monomial ideal
Authors:
Maria-Laura Torrente,
Matteo Varbaro
Abstract:
In this paper we develop a new technique to compute the Betti table of a monomial ideal. We present a prototype implementation of the resulting algorithm and we perform numerical experiments suggesting a very promising efficiency. On the way of describing the method, we also prove new constraints on the shape of the possible Betti tables of a monomial ideal.
In this paper we develop a new technique to compute the Betti table of a monomial ideal. We present a prototype implementation of the resulting algorithm and we perform numerical experiments suggesting a very promising efficiency. On the way of describing the method, we also prove new constraints on the shape of the possible Betti tables of a monomial ideal.
△ Less
Submitted 28 July, 2015; v1 submitted 5 July, 2015;
originally announced July 2015.
-
Stable Complete Intersections
Authors:
Lorenzo Robbiano,
Maria Laura Torrente
Abstract:
A complete intersection of n polynomials in n indeterminates has only a finite number of zeros. In this paper we address the following question: how do the zeros change when the coefficients of the polynomials are perturbed? In the first part we show how to construct semi-algebraic sets in the parameter space over which all the complete intersection ideals share the same number of isolated real ze…
▽ More
A complete intersection of n polynomials in n indeterminates has only a finite number of zeros. In this paper we address the following question: how do the zeros change when the coefficients of the polynomials are perturbed? In the first part we show how to construct semi-algebraic sets in the parameter space over which all the complete intersection ideals share the same number of isolated real zeros. In the second part we show how to modify the complete intersection and get a new one which generates the same ideal but whose real zeros are more stable with respect to perturbations of the coefficients.
△ Less
Submitted 10 February, 2011;
originally announced February 2011.
-
Simple Approximate Varieties for Sets of Empirical Points
Authors:
Claudia Fassino,
Maria-Laura Torrente
Abstract:
We present a symbolic-numeric approach for the analysis of a given set of noisy data, represented as a finite set $\X$ of limited precision points. Starting from $\X$ and a permitted tolerance $\varepsilon$ on its coordinates, our method automatically determines a low degree monic polynomial whose associated variety passes close to each point of $\X$ by less than the given tolerance $\varepsilon$.
We present a symbolic-numeric approach for the analysis of a given set of noisy data, represented as a finite set $\X$ of limited precision points. Starting from $\X$ and a permitted tolerance $\varepsilon$ on its coordinates, our method automatically determines a low degree monic polynomial whose associated variety passes close to each point of $\X$ by less than the given tolerance $\varepsilon$.
△ Less
Submitted 17 February, 2012; v1 submitted 2 August, 2010;
originally announced August 2010.
-
Stable Border Bases for Ideals of Points
Authors:
John Abbott,
Claudia Fassino,
Maria-Laura Torrente
Abstract:
Let $X$ be a set of points whose coordinates are known with limited accuracy; our aim is to give a characterization of the vanishing ideal $I(X)$ independent of the data uncertainty. We present a method to compute a polynomial basis $B$ of $I(X)$ which exhibits structural stability, that is, if $\widetilde X$ is any set of points differing only slightly from $X$, there exists a polynomial set…
▽ More
Let $X$ be a set of points whose coordinates are known with limited accuracy; our aim is to give a characterization of the vanishing ideal $I(X)$ independent of the data uncertainty. We present a method to compute a polynomial basis $B$ of $I(X)$ which exhibits structural stability, that is, if $\widetilde X$ is any set of points differing only slightly from $X$, there exists a polynomial set $\widetilde B$ structurally similar to $B$, which is a basis of the perturbed ideal $ I(\widetilde X)$.
△ Less
Submitted 16 October, 2007; v1 submitted 15 June, 2007;
originally announced June 2007.
-
Thinning out redundant empirical data
Authors:
John Abbott,
Claudia Fassino,
Maria-Laura Torrente
Abstract:
Given a set $X$ of "empirical" points, whose coordinates are perturbed by errors, we analyze whether it contains redundant information, that is whether some of its elements could be represented by a single equivalent point. If this is the case, the empirical information associated to $X$ could be described by fewer points, chosen in a suitable way. We present two different methods to reduce the…
▽ More
Given a set $X$ of "empirical" points, whose coordinates are perturbed by errors, we analyze whether it contains redundant information, that is whether some of its elements could be represented by a single equivalent point. If this is the case, the empirical information associated to $X$ could be described by fewer points, chosen in a suitable way. We present two different methods to reduce the cardinality of $X$ which compute a new set of points equivalent to the original one, that is representing the same empirical information. Though our algorithms use some basic notions of Cluster Analysis they are specifically designed for "thinning out" redundant data. We include some experimental results which illustrate the practical effectiveness of our methods.
△ Less
Submitted 12 February, 2007;
originally announced February 2007.