Search | arXiv e-print repository

Aequitas Flow: Streamlining Fair ML Experimentation

Authors: Sérgio Jesus, Pedro Saleiro, Inês Oliveira e Silva, Beatriz M. Jorge, Rita P. Ribeiro, João Gama, Pedro Bizarro, Rayid Ghani

Abstract: Aequitas Flow is an open-source framework for end-to-end Fair Machine Learning (ML) experimentation in Python. This package fills the existing integration gaps in other Fair ML packages of complete and accessible experimentation. It provides a pipeline for fairness-aware model training, hyperparameter optimization, and evaluation, enabling rapid and simple experiments and result analysis. Aimed at… ▽ More Aequitas Flow is an open-source framework for end-to-end Fair Machine Learning (ML) experimentation in Python. This package fills the existing integration gaps in other Fair ML packages of complete and accessible experimentation. It provides a pipeline for fairness-aware model training, hyperparameter optimization, and evaluation, enabling rapid and simple experiments and result analysis. Aimed at ML practitioners and researchers, the framework offers implementations of methods, datasets, metrics, and standard interfaces for these components to improve extensibility. By facilitating the development of fair ML practices, Aequitas Flow seeks to enhance the adoption of these concepts in AI technologies. △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2210.06376 [pdf, other]

Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded Vocabulary

Authors: Daniel Loureiro, Alípio Mário Jorge

Abstract: Progress on commonsense reasoning is usually measured from performance improvements on Question Answering tasks designed to require commonsense knowledge. However, fine-tuning large Language Models (LMs) on these specific tasks does not directly evaluate commonsense learned during pre-training. The most direct assessments of commonsense knowledge in pre-trained LMs are arguably cloze-style tasks t… ▽ More Progress on commonsense reasoning is usually measured from performance improvements on Question Answering tasks designed to require commonsense knowledge. However, fine-tuning large Language Models (LMs) on these specific tasks does not directly evaluate commonsense learned during pre-training. The most direct assessments of commonsense knowledge in pre-trained LMs are arguably cloze-style tasks targeting commonsense assertions (e.g., A pen is used for [MASK].). However, this approach is restricted by the LM's vocabulary available for masked predictions, and its precision is subject to the context provided by the assertion. In this work, we present a method for enriching LMs with a grounded sense inventory (i.e., WordNet) available at the vocabulary level, without further training. This modification augments the prediction space of cloze-style prompts to the size of a large ontology while enabling finer-grained (sense-level) queries and predictions. In order to evaluate LMs with higher precision, we propose SenseLAMA, a cloze-style task featuring verbalized relations from disambiguated triples sourced from WordNet, WikiData, and ConceptNet. Applying our method to BERT, producing a WordNet-enriched version named SynBERT, we find that LMs can learn non-trivial commonsense knowledge from self-supervision, covering numerous relations, and more effectively than comparable similarity-based approaches. △ Less

Submitted 12 October, 2022; originally announced October 2022.

arXiv:2202.13920 [pdf, other]

doi 10.1051/0004-6361/202142738

Forming planets around stars with non-solar elemental composition

Authors: D. M. Jorge, I. E. E. Kamp, L. B. F. M. Waters, P. Woitke, R. J. Spaargaren

Abstract: Stars in the solar neighbourhood have refractory element ratios slightly different from the Sun. It is unclear how much the condensation of solids and thus the composition of planets forming around these stars is affected. We aim to understand the impact of changing the ratios of refractory elements Mg, Si, and Fe within the range observed in solar type stars within 150~pc on the composition of pl… ▽ More Stars in the solar neighbourhood have refractory element ratios slightly different from the Sun. It is unclear how much the condensation of solids and thus the composition of planets forming around these stars is affected. We aim to understand the impact of changing the ratios of refractory elements Mg, Si, and Fe within the range observed in solar type stars within 150~pc on the composition of planets forming around them. We use the GGchem code to simulate the condensation of solids in protoplanetary disks with a Minimum Mass Solar Nebula around main sequence G-type stars in the Solar neighbourhood. We extract the stellar elemental composition from the Hypatia database. We find that a lower Mg/Si ratio shifts the condensation sequence from forsterite (Mg$_2$SiO$_4$) and SiO to enstatite (MgSiO$_3$) and quartz (SiO$_2$); a lower Fe/S ratio leads to the formation of FeS and FeS$_2$ and little or no Fe-bearing silicates. Ratios of refractory elements translate directly from the gas phase to the condensed phase for $T\,<\,1000$~K. However, ratios with respect to volatile elements (e.g.\ oxygen and sulphur) in the condensates -- the building blocks of planets -- differ from the original stellar composition. Our study shows that the composition of planets crucially depends on the abundances of the stellar system under investigation. Our results can have important implications for planet interiors, which depend strongly on the degree of oxidation and the sulphur abundance. △ Less

Submitted 28 February, 2022; originally announced February 2022.

Comments: Accepted for publication in A&A

arXiv:2201.05156

Proceedings of the 4th Workshop on Online Recommender Systems and User Modeling -- ORSUM 2021

Authors: João Vinagre, Alípio Mário Jorge, Marie Al-Ghossein, Albert Bifet

Abstract: Modern online services continuously generate data at very fast rates. This continuous flow of data encompasses content - e.g., posts, news, products, comments -, but also user feedback - e.g., ratings, views, reads, clicks -, together with context data - user device, spatial or temporal data, user task or activity, weather. This can be overwhelming for systems and algorithms designed to train in b… ▽ More Modern online services continuously generate data at very fast rates. This continuous flow of data encompasses content - e.g., posts, news, products, comments -, but also user feedback - e.g., ratings, views, reads, clicks -, together with context data - user device, spatial or temporal data, user task or activity, weather. This can be overwhelming for systems and algorithms designed to train in batches, given the continuous and potentially fast change of content, context and user preferences or intents. Therefore, it is important to investigate online methods able to transparently adapt to the inherent dynamics of online services. Incremental models that learn from data streams are gaining attention in the recommender systems community, given their natural ability to deal with the continuous flows of data generated in dynamic, complex environments. User modeling and personalization can particularly benefit from algorithms capable of maintaining models incrementally and online. The objective of this workshop is to foster contributions and bring together a growing community of researchers and practitioners interested in online, adaptive approaches to user modeling, recommendation and personalization, and their implications regarding multiple dimensions, such as evaluation, reproducibility, privacy and explainability. △ Less

Submitted 17 January, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

arXiv:2109.14740 [pdf, other]

doi 10.21711/231766362022/rmc498

On the principal eigenvalue of the truncated Laplacian, and submanifolds with bounded mean curvature

Authors: Gregório Pacelli F. Bessa, Luquésio Petrola de M. Jorge, Luciano Mari

Abstract: In this paper, we study the principal eigenvalue $μみゅー(\mathscr{F}_k^-,E)$ of the fully nonlinear operator \[ \mathscr{F}_k^-[u] = \mathcal{P}_k^-(\nabla^2 u) - h |\nabla u| \] on a set $E \Subset \mathbb{R}^n$, where $h \in [0,\infty)$ and $\mathcal{P}_k^-(\nabla^2 u)$ is the sum of the smallest $k$ eigenvalues of the Hessian $\nabla^2 u$. We prove a lower estimate for $μみゅー(\mathscr{F}_k^-,E)$ i… ▽ More In this paper, we study the principal eigenvalue $μみゅー(\mathscr{F}_k^-,E)$ of the fully nonlinear operator \[ \mathscr{F}_k^-[u] = \mathcal{P}_k^-(\nabla^2 u) - h |\nabla u| \] on a set $E \Subset \mathbb{R}^n$, where $h \in [0,\infty)$ and $\mathcal{P}_k^-(\nabla^2 u)$ is the sum of the smallest $k$ eigenvalues of the Hessian $\nabla^2 u$. We prove a lower estimate for $μみゅー(\mathscr{F}_k^-,E)$ in terms of a generalized Hausdorff measure $\mathscr{H}_Ψぷさい(E)$, for suitable $Ψぷさい$ depending on $k$, moving some steps in the direction of the conjecturally sharp estimate \[ μみゅー(\mathscr{F}_k^-,E) \ge C \mathscr{H}^k(E)^{-2/k}. \] The theorem is used to study the spectrum of bounded submanifolds in $\mathbb{R}^n$, improving on our previous work in the direction of a question posed by S.T. Yau. In particular, the result applies to solutions of Plateau's problem for CMC surfaces. △ Less

Submitted 1 January, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

Comments: 17 pages, typos corrected. Accepted in Matemática Contemporânea, volume in honor of Renato Tribuzy for his 75th birthday. Package axessibility included to make the paper available to visually impaired people

Journal ref: Mat. Contemp. 49 (2022), Special issue in honor of Professor Renato de Azevedo Tribuzy on the occasion of his 75th birthday, 212-235

arXiv:2105.12449 [pdf, other]

doi 10.1016/j.artint.2022.103661

LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond

Authors: Daniel Loureiro, Alípio Mário Jorge, Jose Camacho-Collados

Abstract: Distributional semantics based on neural approaches is a cornerstone of Natural Language Processing, with surprising connections to human meaning representation as well. Recent Transformer-based Language Models have proven capable of producing contextual word representations that reliably convey sense-specific information, simply as a product of self-supervision. Prior work has shown that these co… ▽ More Distributional semantics based on neural approaches is a cornerstone of Natural Language Processing, with surprising connections to human meaning representation as well. Recent Transformer-based Language Models have proven capable of producing contextual word representations that reliably convey sense-specific information, simply as a product of self-supervision. Prior work has shown that these contextual representations can be used to accurately represent large sense inventories as sense embeddings, to the extent that a distance-based solution to Word Sense Disambiguation (WSD) tasks outperforms models trained specifically for the task. Still, there remains much to understand on how to use these Neural Language Models (NLMs) to produce sense embeddings that can better harness each NLM's meaning representation abilities. In this work we introduce a more principled approach to leverage information from all layers of NLMs, informed by a probing analysis on 14 NLM variants. We also emphasize the versatility of these sense embeddings in contrast to task-specific models, applying them on several sense-related tasks, besides WSD, while demonstrating improved performance using our proposed approach over prior work focused on sense embeddings. Finally, we discuss unexpected findings regarding layer and model performance variations, and potential applications for downstream tasks. △ Less

Submitted 1 April, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

Comments: Accepted to Artificial Intelligence Journal (AIJ)

Journal ref: Artificial Intelligence Volume 305, April 2022, 103661

arXiv:1903.08504 [pdf, other]

doi 10.1016/j.inffus.2017.07.001

Preference rules for label ranking: Mining patterns in multi-target relations

Authors: Cláudio Rebelo de Sá, Paulo Azevedo, Carlos Soares, Alípio Mário Jorge, Arno Knobbe

Abstract: In this paper we investigate two variants of association rules for preference data, Label Ranking Association Rules and Pairwise Association Rules. Label Ranking Association Rules (LRAR) are the equivalent of Class Association Rules (CAR) for the Label Ranking task. In CAR, the consequent is a single class, to which the example is expected to belong to. In LRAR, the consequent is a ranking of the… ▽ More In this paper we investigate two variants of association rules for preference data, Label Ranking Association Rules and Pairwise Association Rules. Label Ranking Association Rules (LRAR) are the equivalent of Class Association Rules (CAR) for the Label Ranking task. In CAR, the consequent is a single class, to which the example is expected to belong to. In LRAR, the consequent is a ranking of the labels. The generation of LRAR requires special support and confidence measures to assess the similarity of rankings. In this work, we carry out a sensitivity analysis of these similarity-based measures. We want to understand which datasets benefit more from such measures and which parameters have more influence in the accuracy of the model. Furthermore, we propose an alternative type of rules, the Pairwise Association Rules (PAR), which are defined as association rules with a set of pairwise preferences in the consequent. While PAR can be used both as descriptive and predictive models, they are essentially descriptive models. Experimental results show the potential of both approaches. △ Less

Submitted 20 March, 2019; originally announced March 2019.

Journal ref: Information Fusion, Volume 40, March 2018, Pages 112-125

arXiv:1809.00589 [pdf, other]

Affordance Extraction and Inference based on Semantic Role Labeling

Authors: Daniel Loureiro, Alípio Mário Jorge

Abstract: Common-sense reasoning is becoming increasingly important for the advancement of Natural Language Processing. While word embeddings have been very successful, they cannot explain which aspects of 'coffee' and 'tea' make them similar, or how they could be related to 'shop'. In this paper, we propose an explicit word representation that builds upon the Distributional Hypothesis to represent meaning… ▽ More Common-sense reasoning is becoming increasingly important for the advancement of Natural Language Processing. While word embeddings have been very successful, they cannot explain which aspects of 'coffee' and 'tea' make them similar, or how they could be related to 'shop'. In this paper, we propose an explicit word representation that builds upon the Distributional Hypothesis to represent meaning from semantic roles, and allow inference of relations from their meshing, as supported by the affordance-based Indexical Hypothesis. We find that our model improves the state-of-the-art on unsupervised word similarity tasks while allowing for direct inference of new relations from the same vector space. △ Less

Submitted 3 September, 2018; originally announced September 2018.

Comments: Accepted at FEVER - EMNLP 2018

arXiv:1806.05891 [pdf, other]

doi 10.1007/JHEP01(2019)027

Electroluminescence TPCs at the thermal diffusion limit

Authors: C. A. O. Henriques, C. M. B. Monteiro, D. González-Díaz, C. D. R Azevedo, E. D. C. Freitas, R. D. P. Mano, M. R. Jorge, A. F. M. Fernandes, J. J. Gómez-Cadenas, L. M. P. Fernandes, C. Adams, V. Álvarez, L. Arazi, K. Bailey, F. Ballester, J. M. Benlloch-Rodríguez, F. I. G. M. Borges, A. Botas, S. Cárcel, J. V. Carrión, S. Cebrián, C. A. N. Conde, J. Díaz, M. Diesburg, J. Escada , et al. (56 additional authors not shown)

Abstract: The NEXT experiment aims at searching for the hypothetical neutrinoless double-beta decay from the ${}^{136}$Xe isotope using a high-purity xenon TPC. Efficient discrimination of the events through pattern recognition of the topology of primary ionisation tracks is a major requirement for the experiment. However, it is limited by the diffusion of electrons. It is known that the addition of a small… ▽ More The NEXT experiment aims at searching for the hypothetical neutrinoless double-beta decay from the ${}^{136}$Xe isotope using a high-purity xenon TPC. Efficient discrimination of the events through pattern recognition of the topology of primary ionisation tracks is a major requirement for the experiment. However, it is limited by the diffusion of electrons. It is known that the addition of a small fraction of a molecular gas to xenon reduces electron diffusion. On the other hand, the electroluminescence (EL) yield drops and the achievable energy resolution may be compromised. We have studied the effect of adding several molecular gases to xenon (CO${}_{2}$, CH${}_{4}$ and CF${}_{4}$) on the EL yield and energy resolution obtained in a small prototype of driftless gas proportional scintillation counter. We have compared our results on the scintillation characteristics (EL yield and energy resolution) with a microscopic simulation, obtaining the diffusion coefficients in those conditions as well. Accordingly, electron diffusion may be reduced from about 10 mm/$\sqrt{\mathrm{m}}$ for pure xenon down to 2.5 mm/$\sqrt{\mathrm{m}}$ using additive concentrations of about 0.05%, 0.2% and 0.02% for CO${}_{2}$, CH${}_{4}$ and CF${}_{4}$, respectively. Our results show that CF${}_{4}$ admixtures present the highest EL yield in those conditions, but very poor energy resolution as a result of huge fluctuations observed in the EL formation. CH${}_{4}$ presents the best energy resolution despite the EL yield being the lowest. The results obtained with xenon admixtures are extrapolated to the operational conditions of the NEXT-100 TPC. CO${}_{2}$ and CH${}_{4}$ show potential as molecular additives in a large xenon TPC, CH${}_{4}$ showing the best performance and stability to be used in the NEXT-100 TPC, with an extrapolated energy resolution of 0.4% at 2.45 MeV for concentrations below 0.4%. △ Less

Submitted 30 October, 2018; v1 submitted 15 June, 2018; originally announced June 2018.

Comments: 22 pages, 8 figures

MSC Class: 85-05

arXiv:1705.06345 [pdf, ps, other]

An Overview of Data Mining Applications in Oil and Gas Exploration: Structural Geology and Reservoir Property-Issues

Authors: Hamed Nikhalat Jahromi, Alpio M. Jorge

Abstract: Low oil prices have motivated energy executives to look into cost reduction in their supply chains more seriously. To this end, a new technology that is experimentally considered in hydrocarbon exploration is data mining. There are two major categories of geoscientific problems in which data mining is applied: structural geology and reservoir property-issues. This research overviews these categori… ▽ More Low oil prices have motivated energy executives to look into cost reduction in their supply chains more seriously. To this end, a new technology that is experimentally considered in hydrocarbon exploration is data mining. There are two major categories of geoscientific problems in which data mining is applied: structural geology and reservoir property-issues. This research overviews these categories by considering a variety of interesting works in each of them. The result is an understanding of the specific geoscientific problems studied in the literature, along with the relative data mining methods. This way, this work tries to lay the ground for a mutual understanding on oil and gas exploration between the data miners and the geoscientists. △ Less

Submitted 12 May, 2017; originally announced May 2017.

Comments: Part of DM4OG 2017 proceedings (arXiv:1705.03451)

arXiv:1704.01623 [pdf]

Secondary scintillation yield of Xenon with sub-percent levels of CO2 additive: efficiently reducing electron diffusion in HPXe optical TPCs for rare-event detection

Authors: C. A. O. Henriques, E. D. C. Freitas, C. D. R. Azevedo, D. González-Díaz, R. D. P. Mano, M. R. Jorge, L. M. P. Fernandes, C. M. B. Monteiro, J. J. Gómez-Cadenas, V. Álvarez, J. M. Benlloch-Rodríguez, F. I. G. M. Borges, A. Botas, S. Cárcel, J. V. Carrión, S. Cebrián, C. A. N. Conde, J. Díaz, M. Diesburg, J. Escada, R. Esteve, R. Felkai, P. Ferrario, A. L. Ferreira, A. Goldschmidt , et al. (45 additional authors not shown)

Abstract: We have measured the electroluminescence (EL) yield of Xe-CO2 mixtures, with sub-percent CO2 concentrations. We demonstrate that the EL production is still high in these mixtures, 70% and 35% relative to that produced in pure xenon, for CO2 concentrations around 0.05% and 0.1%, respectively. The contribution of the statistical fluctuations in EL production to the energy resolution increases with i… ▽ More We have measured the electroluminescence (EL) yield of Xe-CO2 mixtures, with sub-percent CO2 concentrations. We demonstrate that the EL production is still high in these mixtures, 70% and 35% relative to that produced in pure xenon, for CO2 concentrations around 0.05% and 0.1%, respectively. The contribution of the statistical fluctuations in EL production to the energy resolution increases with increasing CO2 concentration and, for our gas proportional scintillation counter, it is smaller than the contribution of the Fano factor for concentrations below 0.1% CO2. Xe-CO2 mixtures are important alternatives to pure xenon in TPCs based on EL signal amplification with applications in the important field of rare event detection such as directional dark matter, double electron capture and double beta decay detection. The addition of CO2 to pure xenon at the level of 0.05-0.1% can reduce significantly the scale of electron diffusion from 10 mm/sqrt(m) to 2.5 mm/sqrt(m), with high impact on the HPXe TPC discrimination efficiency of the events through pattern recognition of the topology of primary ionisation trails. △ Less

Submitted 12 April, 2017; v1 submitted 5 April, 2017; originally announced April 2017.

arXiv:1702.08177 [pdf, other]

doi 10.1038/nmat5031

Ubiquitous formation of bulk Dirac cones and topological surface states from a single orbital manifold in transition-metal dichalcogenides

Authors: M. S. Bahramy, O. J. Clark, B. -J. Yang, J. Feng, L. Bawden, J. M. Riley, I. Marković, F. Mazzola, V. Sunko, D. Biswas, S. P. Cooil, M. Jorge, J. W. Wells, M. Leandersson, T. Balasubramanian, J. Fujii, I. Vobornik, J. E. Rault, T. K. Kim, M. Hoesch, K. Okawa, M. Asakawa, T. Sasagawa, T. Eknapakul, W. Meevasana , et al. (1 additional authors not shown)

Abstract: Transition-metal dichalcogenides (TMDs) are renowned for their rich and varied properties. They range from metals and superconductors to strongly spin-orbit-coupled semiconductors and charge-density-wave systems, with their single-layer variants one of the most prominent current examples of two-dimensional materials beyond graphene. Their varied ground states largely depend on the transition metal… ▽ More Transition-metal dichalcogenides (TMDs) are renowned for their rich and varied properties. They range from metals and superconductors to strongly spin-orbit-coupled semiconductors and charge-density-wave systems, with their single-layer variants one of the most prominent current examples of two-dimensional materials beyond graphene. Their varied ground states largely depend on the transition metal d-electron-derived electronic states, on which the vast majority of attention has been concentrated to date. Here, we focus on the chalcogen-derived states. From density-functional theory calculations together with spin- and angle- resolved photoemission, we find that these generically host type-II three-dimensional bulk Dirac fermions as well as ladders of topological surface states and surface resonances. We demonstrate how these naturally arise within a single p-orbital manifold as a general consequence of a trigonal crystal field, and as such can be expected across a large number of compounds. Already, we demonstrate their existence in six separate TMDs, opening routes to tune, and ultimately exploit, their topological physics. △ Less

Submitted 19 July, 2018; v1 submitted 27 February, 2017; originally announced February 2017.

Comments: 10 pages, 4 figures

Journal ref: Nature Materials 17, 21-28 (2018) (DOI 10.1038/nmat5031)

arXiv:1611.00558 [pdf, other]

doi 10.1007/978-3-319-65340-2_49

Improving incremental recommenders with online bagging

Authors: João Vinagre, Alípio Mário Jorge, João Gama

Abstract: Online recommender systems often deal with continuous, potentially fast and unbounded flows of data. Ensemble methods for recommender systems have been used in the past in batch algorithms, however they have never been studied with incremental algorithms that learn from data streams. We evaluate online bagging with an incremental matrix factorization algorithm for top-N recommendation with positiv… ▽ More Online recommender systems often deal with continuous, potentially fast and unbounded flows of data. Ensemble methods for recommender systems have been used in the past in batch algorithms, however they have never been studied with incremental algorithms that learn from data streams. We evaluate online bagging with an incremental matrix factorization algorithm for top-N recommendation with positive-only -- binary -- ratings. Our results show that online bagging is able to improve accuracy up to 35% over the baseline, with small computational overhead. △ Less

Submitted 26 March, 2018; v1 submitted 2 November, 2016; originally announced November 2016.

Comments: Submitted to EPIA 2017

Journal ref: In: Oliveira E., Gama J., Vale Z., Lopes Cardoso H. (eds) Progress in Artificial Intelligence. EPIA 2017. Lecture Notes in Computer Science, vol 10423. Springer, Cham

arXiv:1510.03116 [pdf, ps, other]

doi 10.1088/1748-0221/11/01/P01005

First in-beam studies of a Resistive-Plate WELL gaseous multiplier

Authors: S. Bressler, L. Moleri, M. Pitt, S. Kudella, D. C. R. Azevedo, F. D. Amaro, M. R. Jorge, J. M. F. dos Santos, J. F. C. A. Veloso, H. Natal da Luz, L. Arazi, E. Olivieri, A. Breskin

Abstract: We present the results of the first in-beam studies of a medium size (10$\times$10 cm$^2$) Resistive-Plate WELL (RPWELL): a single-sided THGEM coupled to a pad anode through a resistive layer of high bulk resistivity ($\sim$10$^9 Ωおーむ$cm). The 6.2~mm thick (excluding readout electronics) single-stage detector was studied with 150~GeV muons and pions. Signals were recorded from 1$\times$1 cm$^2$ squar… ▽ More We present the results of the first in-beam studies of a medium size (10$\times$10 cm$^2$) Resistive-Plate WELL (RPWELL): a single-sided THGEM coupled to a pad anode through a resistive layer of high bulk resistivity ($\sim$10$^9 Ωおーむ$cm). The 6.2~mm thick (excluding readout electronics) single-stage detector was studied with 150~GeV muons and pions. Signals were recorded from 1$\times$1 cm$^2$ square copper pads with APV25-SRS readout electronics. The single-element detector was operated in Ne\(5% $\mathrm{CH_{4}}$) at a gas gain of a few times 10$^4$, reaching 99$\%$ detection efficiency at average pad multiplicity of $\sim$1.2. Operation at particle fluxes up to $\sim$10$^4$ Hz/cm$^2$ resulted in $\sim$23$\%$ gain drop leading to $\sim$5$\%$ efficiency loss. The striking feature was the discharge-free operation, also in intense pion beams. These results pave the way towards robust, efficient large-scale detectors for applications requiring economic solutions at moderate spatial and energy resolutions. △ Less

Submitted 20 January, 2016; v1 submitted 11 October, 2015; originally announced October 2015.

Comments: Accepted by JINST

arXiv:1504.08175 [pdf, other]

doi 10.13140/2.1.4381.5367

Evaluation of recommender systems in streaming environments

Authors: João Vinagre, Alípio Mário Jorge, João Gama

Abstract: Evaluation of recommender systems is typically done with finite datasets. This means that conventional evaluation methodologies are only applicable in offline experiments, where data and models are stationary. However, in real world systems, user feedback is continuously generated, at unpredictable rates. Given this setting, one important issue is how to evaluate algorithms in such a streaming dat… ▽ More Evaluation of recommender systems is typically done with finite datasets. This means that conventional evaluation methodologies are only applicable in offline experiments, where data and models are stationary. However, in real world systems, user feedback is continuously generated, at unpredictable rates. Given this setting, one important issue is how to evaluate algorithms in such a streaming data environment. In this paper we propose a prequential evaluation protocol for recommender systems, suitable for streaming data environments, but also applicable in stationary settings. Using this protocol we are able to monitor the evolution of algorithms' accuracy over time. Furthermore, we are able to perform reliable comparative assessments of algorithms by computing significance tests over a sliding window. We argue that besides being suitable for streaming data, prequential evaluation allows the detection of phenomena that would otherwise remain unnoticed in the evaluation of both offline and online recommender systems. △ Less

Submitted 30 April, 2015; originally announced April 2015.

Comments: Workshop on 'Recommender Systems Evaluation: Dimensions and Design' (REDD 2014), held in conjunction with RecSys 2014. October 10, 2014, Silicon Valley, United States

arXiv:1210.2981 [pdf, other]

Identifying interfacial molecules in nonplanar interfaces: the generalized ITIM algorithm

Authors: Marcello Sega, Sofia Kantorovich, Pál Jedlovszky, Miguel Jorge

Abstract: We present a generalized version of the ITIM algorithm for the identification of interfacial molecules, which is able to treat arbitrarily shaped interfaces. The algorithm exploits the similarities between the concept of probe sphere used in ITIM and the circumsphere criterion used in the alpha-shapes approach, and can be regarded either as a reference-frame independent version of the former, or a… ▽ More We present a generalized version of the ITIM algorithm for the identification of interfacial molecules, which is able to treat arbitrarily shaped interfaces. The algorithm exploits the similarities between the concept of probe sphere used in ITIM and the circumsphere criterion used in the alpha-shapes approach, and can be regarded either as a reference-frame independent version of the former, or as an extended version of the latter that includes the atomic excluded volume. The new algorithm is applied to compute the intrinsic orientational order parameters of water around a DPC and a cholic acid micelle in aqueous environment, and to the identification of solvent-reachable sites in four model structures for soot. The additional algorithm introduced for the calculation of intrinsic density profiles in arbitrary geometries proved to be extremely useful also for planar interfaces, as it allows to solve the paradox of smeared intrinsic profiles far from the interface △ Less

Submitted 10 October, 2012; originally announced October 2012.

arXiv:1111.2948 [pdf, ps, other]

Using Contextual Information as Virtual Items on Top-N Recommender Systems

Authors: Marcos A. Domingues, Alipio Mario Jorge, Carlos Soares

Abstract: Traditionally, recommender systems for the Web deal with applications that have two dimensions, users and items. Based on access logs that relate these dimensions, a recommendation model can be built and used to identify a set of N items that will be of interest to a certain user. In this paper we propose a method to complement the information in the access logs with contextual information without… ▽ More Traditionally, recommender systems for the Web deal with applications that have two dimensions, users and items. Based on access logs that relate these dimensions, a recommendation model can be built and used to identify a set of N items that will be of interest to a certain user. In this paper we propose a method to complement the information in the access logs with contextual information without changing the recommendation algorithm. The method consists in representing context as virtual items. We empirically test this method with two top-N recommender systems, an item-based collaborative filtering technique and association rules, on three data sets. The results show that our method is able to take advantage of the context (new dimensions) when it is informative. △ Less

Submitted 15 November, 2011; v1 submitted 12 November, 2011; originally announced November 2011.

Comments: Workshop on Context-Aware Recommender Systems (CARS'09) in conjunction with the 3rd ACM Conference on Recommender Systems (RecSys'09)

ACM Class: I.2.6

arXiv:0901.0512 [pdf]

Expected Performance of the ATLAS Experiment - Detector, Trigger and Physics

Authors: The ATLAS Collaboration, G. Aad, E. Abat, B. Abbott, J. Abdallah, A. A. Abdelalim, A. Abdesselam, O. Abdinov, B. Abi, M. Abolins, H. Abramowicz, B. S. Acharya, D. L. Adams, T. N. Addy, C. Adorisio, P. Adragna, T. Adye, J. A. Aguilar-Saavedra, M. Aharrouche, S. P. Ahlen, F. Ahles, A. Ahmad, H. Ahmed, G. Aielli, T. Akdogan , et al. (2587 additional authors not shown)

Abstract: A detailed study is presented of the expected performance of the ATLAS detector. The reconstruction of tracks, leptons, photons, missing energy and jets is investigated, together with the performance of b-tagging and the trigger. The physics potential for a variety of interesting physics processes, within the Standard Model and beyond, is examined. The study comprises a series of notes based on… ▽ More A detailed study is presented of the expected performance of the ATLAS detector. The reconstruction of tracks, leptons, photons, missing energy and jets is investigated, together with the performance of b-tagging and the trigger. The physics potential for a variety of interesting physics processes, within the Standard Model and beyond, is examined. The study comprises a series of notes based on simulations of the detector and physics processes, with particular emphasis given to the data expected from the first years of operation of the LHC at CERN. △ Less

Submitted 14 August, 2009; v1 submitted 28 December, 2008; originally announced January 2009.

Showing 1–18 of 18 results for author: Jorge, M