(Translated by https://www.hiragana.jp/)
Search | arXiv e-print repository
Skip to main content

Showing 1–48 of 48 results for author: Grant, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00698  [pdf

    cs.LG cs.AI econ.GN math.NA

    NourishNet: Proactive Severity State Forecasting of Food Commodity Prices for Global Warning Systems

    Authors: Sydney Balboni, Grace Ivey, Brett Storoe, John Cisler, Tyge Plater, Caitlyn Grant, Ella Bruce, Benjamin Paulson

    Abstract: Price volatility in global food commodities is a critical signal indicating potential disruptions in the food market. Understanding forthcoming changes in these prices is essential for bolstering food security, particularly for nations at risk. The Food and Agriculture Organization of the United Nations (FAO) previously developed sophisticated statistical frameworks for the proactive prediction of… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: MICS 2024 1st Place Paper, MSOE AI-Club Research Group

  2. arXiv:2403.14074  [pdf, other

    cs.IR cs.CL cs.LG

    M3: A Multi-Task Mixed-Objective Learning Framework for Open-Domain Multi-Hop Dense Sentence Retrieval

    Authors: Yang Bai, Anthony Colas, Christan Grant, Daisy Zhe Wang

    Abstract: In recent research, contrastive learning has proven to be a highly effective method for representation learning and is widely used for dense retrieval. However, we identify that relying solely on contrastive learning can lead to suboptimal retrieval performance. On the other hand, despite many retrieval datasets supporting various learning objectives beyond contrastive learning, combining them eff… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  3. arXiv:2312.14372  [pdf, other

    physics.data-an cs.LG

    Generative Models for Simulation of KamLAND-Zen

    Authors: Z. Fu, C. Grant, D. M. Krawiec, A. Li, L. Winslow

    Abstract: The next generation of searches for neutrinoless double beta decay (0νにゅー\b{eta}\b{eta}) are poised to answer deep questions on the nature of neutrinos and the source of the Universe's matter-antimatter asymmetry. They will be looking for event rates of less than one event per ton of instrumented isotope per year. To claim discovery, accurate and efficient simulations of detector events that mimic 0νにゅー▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Submitted to EPJC

  4. arXiv:2312.14211  [pdf, ps, other

    cs.CL astro-ph.IM cs.AI

    Experimenting with Large Language Models and vector embeddings in NASA SciX

    Authors: Sergi Blanco-Cuaresma, Ioana Ciucă, Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Kelly E. Lockhart, Felix Grezes, Thomas Allen, Golnaz Shapurian, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Daniel Chivvis, Fernanda de Macedo Alves, Jean-Claude Paquin, Jennifer Bartlett, Mugdha Polimera, Stephanie Jarmak

    Abstract: Open-source Large Language Models enable projects such as NASA SciX (i.e., NASA ADS) to think out of the box and try alternative approaches for information retrieval and data augmentation, while respecting data copyright and users' privacy. However, when large language models are directly prompted with questions without any context, they are prone to hallucination. At NASA SciX we have developed a… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: To appear in the proceedings of the 33th annual international Astronomical Data Analysis Software & Systems (ADASS XXXIII)

  5. arXiv:2311.13816  [pdf, other

    cs.LG cs.AI cs.CY

    Algorithmic Fairness Generalization under Covariate and Dependence Shifts Simultaneously

    Authors: Chen Zhao, Kai Jiang, Xintao Wu, Haoliang Wang, Latifur Khan, Christan Grant, Feng Chen

    Abstract: The endeavor to preserve the generalization of a fair and invariant classifier across domains, especially in the presence of distribution shifts, becomes a significant and intricate challenge in machine learning. In response to this challenge, numerous effective algorithms have been developed with a focus on addressing the problem of fairness-aware domain generalization. These algorithms are desig… ▽ More

    Submitted 21 May, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: Accepted by KDD 2024 research track

  6. arXiv:2306.01007  [pdf, other

    cs.LG cs.AI

    Towards Fair Disentangled Online Learning for Changing Environments

    Authors: Chen Zhao, Feng Mi, Xintao Wu, Kai Jiang, Latifur Khan, Christan Grant, Feng Chen

    Abstract: In the problem of online learning for changing environments, data are sequentially received one after another over time, and their distribution assumptions may vary frequently. Although existing methods demonstrate the effectiveness of their learning algorithms by providing a tight bound on either dynamic regret or adaptive regret, most of them completely ignore learning with model fairness, defin… ▽ More

    Submitted 16 July, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted by KDD 2023

  7. arXiv:2305.14553  [pdf

    cs.CR cs.AI cs.CY

    Adversarial Machine Learning and Cybersecurity: Risks, Challenges, and Legal Implications

    Authors: Micah Musser, Andrew Lohn, James X. Dempsey, Jonathan Spring, Ram Shankar Siva Kumar, Brenda Leong, Christina Liaghati, Cindy Martinez, Crystal D. Grant, Daniel Rohrer, Heather Frase, Jonathan Elliott, John Bansemer, Mikel Rodriguez, Mitt Regan, Rumman Chowdhury, Stefan Hermanek

    Abstract: In July 2022, the Center for Security and Emerging Technology (CSET) at Georgetown University and the Program on Geopolitics, Technology, and Governance at the Stanford Cyber Policy Center convened a workshop of experts to examine the relationship between vulnerabilities in artificial intelligence systems and more traditional types of software vulnerabilities. Topics discussed included the extent… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  8. arXiv:2212.00744  [pdf, ps, other

    cs.CL astro-ph.IM

    Improving astroBERT using Semantic Textual Similarity

    Authors: Felix Grezes, Thomas Allen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Shinyi Chen, Jennifer Koch, Taylor Jacovich, Pavlos Protopapas

    Abstract: The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first… ▽ More

    Submitted 29 November, 2022; originally announced December 2022.

  9. arXiv:2203.01870  [pdf, other

    physics.ins-det cs.LG

    KamNet: An Integrated Spatiotemporal Deep Neural Network for Rare Event Search in KamLAND-Zen

    Authors: A. Li, Z. Fu, L. Winslow, C. Grant, H. Song, H. Ozaki, I. Shimizu, A. Takeuchi

    Abstract: Rare event searches allow us to search for new physics at energy scales inaccessible with other means by leveraging specialized large-mass detectors. Machine learning provides a new tool to maximize the information provided by these detectors. The information is sparse, which forces these algorithms to start from the lowest level data and exploit all symmetries in the detector to produce results.… ▽ More

    Submitted 26 July, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 12 pages, dual submission with upcoming KamLAND-Zen 800 main result

  10. arXiv:2202.00777  [pdf, ps, other

    cs.HC astro-ph.IM

    Web accessibility trends and implementation in dynamic web applications

    Authors: Timothy W. Hostetler, Shinyi Chen, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Donna M. Thompson, Roman Chyla, Golnaz Shapurian, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Stephen McDonald, Felix Grezes

    Abstract: The NASA Astrophysics Data System (ADS), a critical research service for the astrophysics community, strives to provide the most accessible and inclusive environment for the discovery and exploration of the astronomical literature. Part of this goal involves creating a digital platform that can accommodate everybody, including those with disabilities that would benefit from alternative ways to pre… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to ADASS XXXI (2021)

  11. Lex Rosetta: Transfer of Predictive Models Across Languages, Jurisdictions, and Legal Domains

    Authors: Jaromir Savelka, Hannes Westermann, Karim Benyekhlef, Charlotte S. Alexander, Jayla C. Grant, David Restrepo Amariles, Rajaa El Hamdani, Sébastien Meeùs, Michał Araszkiewicz, Kevin D. Ashley, Alexandra Ashley, Karl Branting, Mattia Falduti, Matthias Grabmair, Jakub Harašta, Tereza Novotná, Elizabeth Tippett, Shiwanni Johnson

    Abstract: In this paper, we examine the use of multi-lingual sentence embeddings to transfer predictive models for functional segmentation of adjudicatory decisions across jurisdictions, legal systems (common and civil law), languages, and domains (i.e. contexts). Mechanisms for utilizing linguistic resources outside of their original context have significant potential benefits in AI & Law because differenc… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: 10 pages

    Journal ref: In Proceedings of ICAIL 2021, pp. 129-138. 2021

  12. arXiv:2112.00590  [pdf, ps, other

    cs.CL astro-ph.IM

    Building astroBERT, a language model for Astronomy & Astrophysics

    Authors: Felix Grezes, Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Golnaz Shapurian, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Nemanja Martinovic, Shinyi Chen, Chris Tanner, Pavlos Protopapas

    Abstract: The existing search tools for exploring the NASA Astrophysics Data System (ADS) can be quite rich and empowering (e.g., similar and trending operators), but researchers are not yet allowed to fully leverage semantic search. For example, a query for "results from the Planck mission" should be able to distinguish between all the various meanings of Planck (person, mission, constant, institutions and… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  13. arXiv:2111.03984  [pdf, other

    cs.CY cs.LG

    Proposing an Interactive Audit Pipeline for Visual Privacy Research

    Authors: Jasmine DeHart, Chenguang Xu, Lisa Egede, Christan Grant

    Abstract: In an ideal world, deployed machine learning models will enhance our society. We hope that those models will provide unbiased and ethical decisions that will benefit everyone. However, this is not always the case; issues arise during the data preparation process throughout the steps leading to the models' deployment. The continued use of biased datasets and processes will adversely damage communit… ▽ More

    Submitted 23 November, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: Extended version of IEEE BigData 2021 Short Paper, 14 pages, grammar edits

  14. arXiv:2106.02318  [pdf, other

    cs.CL

    AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding

    Authors: Jun Yan, Nasser Zalmout, Yan Liang, Christan Grant, Xiang Ren, Xin Luna Dong

    Abstract: Automatic extraction of product attribute values is an important enabling technology in e-Commerce platforms. This task is usually modeled using sequence labeling architectures, with several extensions to handle multi-attribute extraction. One line of previous work constructs attribute-specific models, through separate decoders or entirely separate models. However, this approach constrains knowled… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL-IJCNLP 2021

  15. arXiv:2009.05048  [pdf, ps, other

    cs.SE astro-ph.IM

    Agile methodologies in teams with highly creative and autonomous members

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi

    Abstract: The Agile manifesto encourages us to value individuals and interactions over processes and tools, while Scrum, the most adopted Agile development methodology, is essentially based on roles, events, artifacts, and the rules that bind them together (i.e., processes). Moreover, it is generally proclaimed that whenever a Scrum project does not succeed, the reason is because Scrum was not implemented c… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: To appear in the proceedings of the 29th annual international Astronomical Data Analysis Software & Systems (ADASS XXIX)

  16. arXiv:1907.06234  [pdf, other

    astro-ph.IM cs.DL

    Robust Archives Maximize Scientific Accessibility

    Authors: J. E. G. Peek, Vandana Desai, Richard L. White, Raffaele D'Abrusco, Joseph M. Mazzarella, Carolyn Grant, Jenny L. Novacescu, Elena Scire, Sherry Winkelman

    Abstract: We present a bibliographic analysis of Chandra, Hubble, and Spitzer publications. We find (a) archival data are used in >60% of the publication output and (b) archives for these missions enable a much broader set of institutions and countries to scientifically use data from these missions. Specifically, we find that authors from institutions that have published few papers from a given mission publ… ▽ More

    Submitted 14 July, 2019; originally announced July 2019.

    Comments: White Paper submitted to the NAS call for Astro2020 Decadal Survey APC papers

  17. arXiv:1907.00146  [pdf, other

    cs.DB cs.HC

    DataPop: Knowledge Base Population using Distributed Voice Enabled Devices

    Authors: Elena Montes, Monique Shotande, Daniel Helm, Christan Grant

    Abstract: Data scientists are constantly creating methods to efficiently and accurately populate big data sets for use in large-scale applications. Many recent efforts utilize crowd-sourcing and textual interfaces. In this paper, we propose a new method of curating data; namely, creating a multi-device Amazon Alexa Skill in the form of a research trivia game. Users experience a synchronized gaming experienc… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.

    Comments: 7 pages, 2 references, unsubmitted

  18. arXiv:1905.05633  [pdf, other

    cs.NI

    Using Delay Tolerant Networks as a Backbone for Low-cost Smart Cities

    Authors: Oluwashina Madamori, Esther Max-Onakpoya, Christan Grant, Corey E. Baker

    Abstract: Rapid urbanization burdens city infrastructure and creates the need for local governments to maximize the usage of resources to serve its citizens. Smart city projects aim to alleviate the urbanization problem by deploying a vast amount of Internet-of-things (IoT) devices to monitor and manage environmental conditions and infrastructure. However, smart city projects can be extremely expensive to d… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: 3 pages, accepted to IEEE SmartComp 2019

  19. arXiv:1901.05463  [pdf, ps, other

    astro-ph.IM cs.DL

    Fundamentals of effective cloud management for the new NASA Astrophysics Data System

    Authors: Sergi Blanco-Cuaresma, Alberto Accomazzi, Michael J. Kurtz, Edwin Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Stephen McDonald, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton, Kelly E. Lockhart, Kris Bukovi, Nathan Rapport

    Abstract: The new NASA Astrophysics Data System (ADS) is designed with a serviceoriented architecture (SOA) that consists of multiple customized Apache Solr search engine instances plus a collection of microservices, containerized using Docker, and deployed in Amazon Web Services (AWS). For complex systems, like the ADS, this loosely coupled architecture can lead to a more scalable, reliable and resilient s… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: To appear in the proceedings of the 28th annual international Astronomical Data Analysis Software & Systems (ADASS XXVIII)

  20. arXiv:1806.08471  [pdf, other

    cs.CY

    Visual Content Privacy Leaks on Social Media Networks

    Authors: Jasmine DeHart, Christan Grant

    Abstract: With the growth and accessibility of mobile devices and internet, the ease of posting and sharing content on social media networks (SMNs) has increased exponentially. Many users post images that contain "privacy leaks" regarding themselves or someone else. Privacy leaks include any instance in which a transfer of personal identifying visual content is shared on SMNs. Private visual content (images… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: 2 pages, 3 figures, IEEE Security and Privacy Conference, Poster

    ACM Class: K.6.5; K.6.5; I.4.9

  21. arXiv:1712.00715  [pdf, other

    cs.HC

    Formalizing Interruptible Algorithms for Human over-the-loop Analytics

    Authors: Austin Graham, Yan Liang, Le Gruenwald, Christan Grant

    Abstract: Traditional data mining algorithms are exceptional at seeing patterns in data that humans cannot, but are often confused by details that are obvious to the organic eye. Algorithms that include humans "in-the-loop" have proved beneficial for accuracy by allowing a user to provide direction in these situations, but the slowness of human interactions causes execution times to increase exponentially.… ▽ More

    Submitted 3 December, 2017; originally announced December 2017.

    Comments: 6 pages, 3 figures, Human-Machine Collaboration in Big Data

  22. arXiv:1710.11048  [pdf

    cs.CY

    Demographics in Social Media Data for Public Health Research: Does it matter?

    Authors: Nina Cesare, Christan Grant, Jared B. Hawkins, John S. Brownstein, Elaine O. Nsoesie

    Abstract: Social media data provides propitious opportunities for public health research. However, studies suggest that disparities may exist in the representation of certain populations (e.g., people of lower socioeconomic status). To quantify and address these disparities in population representation, we need demographic information, which is usually missing from most social media platforms. Here, we prop… ▽ More

    Submitted 6 November, 2017; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: Presented at the Data For Good Exchange 2017

  23. New ADS Functionality for the Curator

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Steven McDonald, Taylor J. Shaulis, Sergi Blanco-Cuaresma, Golnaz Shapurian, Timothy W. Hostetler, Matthew R. Templeton

    Abstract: In this paper we provide an update concerning the operations of the NASA Astrophysics Data System (ADS), its services and user interface, and the content currently indexed in its database. As the primary information system used by researchers in Astronomy, the ADS aims to provide a comprehensive index of all scholarly resources appearing in the literature. With the current effort in our community… ▽ More

    Submitted 23 October, 2017; originally announced October 2017.

    Comments: Submitted to the Proceedings of Library and Information Services in Astronomy VIII, Strasbourg, France

  24. arXiv:1704.01218  [pdf, other

    cs.DS cs.DB

    Storing complex data sharing policies with the Min Mask Sketch

    Authors: Stephen Smart, Christan Grant

    Abstract: More data is currently being collected and shared by software applications than ever before. In many cases, the user is asked if either all or none of their data can be shared. We hypothesize that in some cases, users would like to share data in more complex ways. In order to implement the sharing of data using more complicated privacy preferences, complex data sharing policies must be used. These… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

    Comments: 8 pages, 14 figures

  25. arXiv:1702.01807  [pdf

    cs.SI cs.CY

    How well can machine learning predict demographics of social media users?

    Authors: Nina Cesare, Christan Grant, Quynh Nguyen, Hedwig Lee, Elaine O. Nsoesie

    Abstract: The wide use of social media sites and other digital technologies have resulted in an unprecedented availability of digital data that are being used to study human behavior across research domains. Although unsolicited opinions and sentiments are available on these platforms, demographic details are usually missing. Demographic information is pertinent in fields such as demography and public healt… ▽ More

    Submitted 30 May, 2018; v1 submitted 6 February, 2017; originally announced February 2017.

    Comments: 24 pages, 3 figures

  26. arXiv:1601.07858  [pdf, ps, other

    astro-ph.IM cs.DL

    Aggregation and Linking of Observational Metadata in the ADS

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Carolyn S. Grant, Donna M. Thompson, Roman Chyla, Alexandra Holachek, Jonathan Elliott

    Abstract: We discuss current efforts behind the curation of observing proposals, archive bibliographies, and data links in the NASA Astrophysics Data System (ADS). The primary data in the ADS is the bibliographic content from scholarly articles in Astronomy and Physics, which ADS aggregates from publishers, arXiv and conference proceeding sites. This core bibliographic information is then further enriched b… ▽ More

    Submitted 28 January, 2016; originally announced January 2016.

    Comments: 4 pages, Proceedings of the ADASS XXV conference

  27. arXiv:1508.03116  [pdf, other

    cs.DB

    Query-Driven Sampling for Collective Entity Resolution

    Authors: Christan Grant, Daisy Zhe Wang, Michael L. Wick

    Abstract: Probabilistic databases play a preeminent role in the processing and management of uncertain data. Recently, many database research efforts have integrated probabilistic models into databases to support tasks such as information extraction and labeling. Many of these efforts are based on batch oriented inference which inhibits a realtime workflow. One important task is entity resolution (ER). ER i… ▽ More

    Submitted 13 August, 2015; originally announced August 2015.

  28. arXiv:1503.05881  [pdf, other

    cs.DL

    ADS 2.0: new architecture, API and services

    Authors: Roman Chyla, Alberto Accomazzi, Alexandra Holachek, Carolyn S. Grant, Jonathan Elliott, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray, Vladimir Sudilovsky

    Abstract: The ADS platform is undergoing the biggest rewrite of its 20-year history. While several components have been added to its architecture over the past couple of years, this talk will concentrate on the underpinnings of ADS's search layer and its API. To illustrate the design of the components in the new system, we will show how the new ADS user interface is built exclusively on top of the API using… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: ADASS Conference 2014

  29. arXiv:1503.04194  [pdf, other

    astro-ph.IM cs.DL

    ADS: The Next Generation Search Platform

    Authors: Alberto Accomazzi, Michael J. Kurtz, Edwin A. Henneken, Roman Chyla, James Luker, Carolyn S. Grant, Donna M. Thompson, Alexandra Holachek, Rahul Dave, Stephen S. Murray

    Abstract: Four years after the last LISA meeting, the NASA Astrophysics Data System (ADS) finds itself in the middle of major changes to the infrastructure and contents of its database. In this paper we highlight a number of features of great importance to librarians and discuss the additional functionality that we are currently developing. Starting in 2011, the ADS started to systematically collect, parse… ▽ More

    Submitted 13 March, 2015; originally announced March 2015.

    Comments: Submitted to Library and Information Services in Astronomy VII, Naples, Italy

  30. arXiv:1406.4542  [pdf, ps, other

    cs.DL astro-ph.IM

    Computing and Using Metrics in the ADS

    Authors: Edwin A. Henneken, Alberto Accomazzi, Michael J. Kurtz, Carolyn S. Grant, Donna Thompson, Jay Luker, Roman Chyla, Alexandra Holachek, Stephen S. Murray

    Abstract: Finding measures for research impact, be it for individuals, institutions, instruments or projects, has gained a lot of popularity. More papers than ever are being written on new impact measures, and problems with existing measures are being pointed out on a regular basis. Funding agencies require impact statistics in their reports, job candidates incorporate them in their resumes, and publication… ▽ More

    Submitted 17 June, 2014; originally announced June 2014.

    Comments: to appear in proceedings of LISA VII conference, Naples, Italy

  31. Finding Your Literature Match -- A Recommender System

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn Grant, Donna Thompson, Elizabeth Bohlen, Giovanni Di Milia, Jay Luker, Stephen S. Murray

    Abstract: The universe of potentially interesting, searchable literature is expanding continuously. Besides the normal expansion, there is an additional influx of literature because of interdisciplinary boundaries becoming more and more diffuse. Hence, the need for accurate, efficient and intelligent search tools is bigger than ever. Even with a sophisticated search engine, looking for information can still… ▽ More

    Submitted 13 May, 2010; originally announced May 2010.

    Comments: Contribution to the proceedings of the colloquium Future Professional Communication in Astronomy II, 13-14 April 2010, Cambridge, Massachusetts. 11 pages, 4 figures.

  32. arXiv:0912.5235  [pdf, ps, other

    astro-ph.IM cs.DL cs.IR physics.soc-ph

    Using Multipartite Graphs for Recommendation and Discovery

    Authors: Michael J. Kurtz, Alberto Accomazzi, Edwin Henneken, Giovanni Di Milia, Carolyn S. Grant

    Abstract: The Smithsonian/NASA Astrophysics Data System exists at the nexus of a dense system of interacting and interlinked information networks. The syntactic and the semantic content of this multipartite graph structure can be combined to provide very specific research recommendations to the scientist/user.

    Submitted 30 December, 2009; originally announced December 2009.

    Comments: To appear in ADASS XIX, ASP Conf Proc

  33. arXiv:0909.4789  [pdf

    cs.DL physics.soc-ph

    The Bibliometric Properties of Article Readership Information

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Markus Demleitner, Stephen S. Murray, Nathalie Martimbeau, Barbara Elwell

    Abstract: The NASA Astrophysics Data System (ADS), along with astronomy's journals and data centers (a collaboration dubbed URANIA), has developed a distributed on-line digital library which has become the dominant means by which astronomers search, access and read their technical literature. Digital libraries such as the NASA Astrophysics Data System permit the easy accumulation of a new type of bibliome… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56..111K This is the second paper (the first is Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library) from the original article The NASA Astrophysics Data System: Sociology, Bibliometrics, and Impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 111 (2005)

  34. arXiv:0909.4786  [pdf

    cs.DL physics.soc-ph

    Worldwide Use and Impact of the NASA Astrophysics Data System Digital Library

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Markus Demleitner, Stephen S. Murray

    Abstract: By combining data from the text, citation, and reference databases with data from the ADS readership logs we have been able to create Second Order Bibliometric Operators, a customizable class of collaborative filters which permits substantially improved accuracy in literature queries. Using the ADS usage logs along with membership statistics from the International Astronomical Union and data o… ▽ More

    Submitted 25 September, 2009; originally announced September 2009.

    Comments: ADS bibcode: 2005JASIS..56...36K This is a portion (The bibliometric properties of article readership information is the other part) of the article: The NASA Astrophysics Data System: Sociology, bibliometrics and impact, which went on-line in the summer of 2003

    Journal ref: The Journal of the American Society for Information Science and Technology, Vol. 56, p. 36. (2005)

  35. Use of Astronomical Literature - A Report on Usage Patterns

    Authors: Edwin A. Henneken, Michael J. Kurtz, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: In this paper we present a number of metrics for usage of the SAO/NASA Astrophysics Data System (ADS). Since the ADS is used by the entire astronomical community, these are indicative of how the astronomical literature is used. We will show how the use of the ADS has changed both quantitatively and qualitatively. We will also show that different types of users access the system in different ways… ▽ More

    Submitted 3 October, 2008; v1 submitted 1 August, 2008; originally announced August 2008.

    Comments: 12 pages, 8 figures, 2 tables. Accepted by Journal of Informetrics

  36. arXiv:cs/0701035  [pdf, ps, other

    cs.DL astro-ph

    Finding Astronomical Communities Through Co-readership Analysis

    Authors: Edwin A. Henneken, Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Whenever a large group of people are engaged in an activity, communities will form. The nature of these communities depends on the relationship considered. In the group of people who regularly use scholarly literature, a relationship like ``person i and person j have cited the same paper'' might reveal communities of people working in a particular field. On this poster, we will investigate the r… ▽ More

    Submitted 5 January, 2007; originally announced January 2007.

    Comments: poster presented at the 209th AAS Meeting, 7 pages, 4 figures

  37. arXiv:cs/0610030  [pdf, ps, other

    cs.DL cs.HC

    Paper to Screen: Processing Historical Scans in the ADS

    Authors: Donna M. Thompson, Alberto Accomazzi, Guenther Eichhorn, Carolyn Grant, Edwin Henneken, Michael J. Kurtz, Elizabeth Bohlen, Stephen S. Murray

    Abstract: The NASA Astrophysics Data System in conjunction with the Wolbach Library at the Harvard-Smithsonian Center for Astrophysics is working on a project to microfilm historical observatory publications. The microfilm is then scanned for inclusion in the ADS. The ADS currently contains over 700,000 scanned pages of volumes of historical literature. Many of these volumes lack clear pagination or other… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of Library and Information Services in Astronomy; to be published in the ASP Conference Series

  38. arXiv:cs/0610029  [pdf, ps, other

    cs.DL cs.DB

    Data in the ADS -- Understanding How to Use it Better

    Authors: Carolyn S. Grant, Alberto Accomazzi, Donna Thompson, Edwin Henneken, Guenther Eichhorn, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Smithsonian/NASA ADS Abstract Service contains a wealth of data for astronomers and librarians alike, yet the vast majority of usage consists of rudimentary searches. Hints on how to obtain more focused search results by using more of the various capabilities of the ADS are presented, including searching by affiliation. We also discuss the classification of articles by content and by referee… ▽ More

    Submitted 5 October, 2006; originally announced October 2006.

    Comments: 4 pages; submitted to the proceedings of the Library and Information Services in Astronomy V; to be published by ASP Conference Proceedings

  39. arXiv:cs/0610011  [pdf, ps, other

    cs.DL astro-ph cs.DB cs.IR

    Creation and use of Citations in the ADS

    Authors: Alberto Accomazzi, Gunther Eichhorn, Michael J. Kurtz, Carolyn S. Grant, Edwin Henneken, Markus Demleitner, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: With over 20 million records, the ADS citation database is regularly used by researchers and librarians to measure the scientific impact of individuals, groups, and institutions. In addition to the traditional sources of citations, the ADS has recently added references extracted from the arXiv e-prints on a nightly basis. We review the procedures used to harvest and identify the reference data u… ▽ More

    Submitted 3 October, 2006; originally announced October 2006.

    Comments: 9 pages; to be published in the proceedings of the conference "Library and Information Services V," June 2006, Cambridge, MA, USA

  40. arXiv:cs/0610008  [pdf, ps, other

    cs.DL astro-ph cs.DB

    Connectivity in the Astronomy Digital Library

    Authors: Günther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Astrophysics Data System (ADS) provides an extensive system of links between the literature and other on-line information. Recently, the journals of the American Astronomical Society (AAS) and a group of NASA data centers have collaborated to provide more links between on-line data obtained by space missions and the on-line journals. Authors can now specify which data sets they have used in… ▽ More

    Submitted 2 October, 2006; originally announced October 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  41. arXiv:cs/0610007  [pdf, ps, other

    cs.DL astro-ph cs.DB

    Full Text Searching in the Astrophysics Data System

    Authors: Günther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Edwin A. Henneken, Donna M. Thompson, Michael J. Kurtz, Stephen S. Murray

    Abstract: The Smithsonian/NASA Astrophysics Data System (ADS) provides a search system for the astronomy and physics scholarly literature. All major and many smaller astronomy journals that were published on paper have been scanned back to volume 1 and are available through the ADS free of charge. All scanned pages have been converted to text and can be searched through the ADS Full Text Search System. In… ▽ More

    Submitted 5 October, 2006; v1 submitted 2 October, 2006; originally announced October 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  42. E-prints and Journal Articles in Astronomy: a Productive Co-existence

    Authors: Edwin A. Henneken, Michael J. Kurtz, Simeon Warner, Paul Ginsparg, Guenther Eichhorn, Alberto Accomazzi, Carolyn S. Grant, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Are the e-prints (electronic preprints) from the arXiv repository being used instead of the journal articles? In this paper we show that the e-prints have not undermined the usage of journal papers in the astrophysics community. As soon as the journal article is published, the astronomical community prefers to read the journal article and the use of e-prints through the NASA Astrophysics Data Sy… ▽ More

    Submitted 22 September, 2006; originally announced September 2006.

    Comments: 8 pages, 4 figures, submitted to Learned Publishing

    Journal ref: Learn.Publ.20:16-22,2007

  43. arXiv:astro-ph/0609794  [pdf, ps, other

    astro-ph cs.DL

    The Future of Technical Libraries

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Edwin Henneken, Donna Thompson, Elizabeth Bohlen, Stephen S. Murray

    Abstract: Technical libraries are currently experiencing very rapid change. In the near future their mission will change, their physical nature will change, and the skills of their employees will change. While some will not be able to make these changes, and will fail, others will lead us into a new era.

    Submitted 28 September, 2006; originally announced September 2006.

    Comments: To appear in Library and Information Systems in Astronomy V

  44. arXiv:cs/0608027  [pdf, ps, other

    cs.DL astro-ph

    myADS-arXiv - a Tailor-Made, Open Access, Virtual Journal

    Authors: E. Henneken, M. J. Kurtz, G. Eichhorn, A. Accomazzi, C. S. Grant, D. Thompson, E. Bohlen, S. S. Murray

    Abstract: The myADS-arXiv service provides the scientific community with a one stop shop for staying up-to-date with a researcher's field of interest. The service provides a powerful and unique filter on the enormous amount of bibliographic information added to the ADS on a daily basis. It also provides a complete view with the most relevant papers available in the subscriber's field of interest. With thi… ▽ More

    Submitted 4 August, 2006; originally announced August 2006.

    Comments: 4 pages, 2 figures, poster paper to appear in the proceedings of the LISA V conference

  45. arXiv:cs/0604061  [pdf

    cs.DL astro-ph

    Effect of E-printing on Citation Rates in Astronomy and Physics

    Authors: Edwin A. Henneken, Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Donna Thompson, Stephen S. Murray

    Abstract: In this report we examine the change in citation behavior since the introduction of the arXiv e-print repository (Ginsparg, 2001). It has been observed that papers that initially appear as arXiv e-prints get cited more than papers that do not (Lawrence, 2001; Brody et al., 2004; Schwarz & Kennicutt, 2004; Kurtz et al., 2005a, Metcalfe, 2005). Using the citation statistics from the NASA-Smithsoni… ▽ More

    Submitted 5 June, 2006; v1 submitted 13 April, 2006; originally announced April 2006.

    Comments: Submitted to the Journal of Electronic Publishing. 11 pages with 5 figures

  46. arXiv:cs/0511002  [pdf, ps, other

    cs.IR cs.DL

    Bibliographic Classification using the ADS Databases

    Authors: Alberto Accomazzi, Michael J. Kurtz, Guenther Eichhorn, Edwin Henneken, Carolyn S. Grant, Markus Demleitner, Stephen S. Murray

    Abstract: We discuss two techniques used to characterize bibliographic records based on their similarity to and relationship with the contents of the NASA Astrophysics Data System (ADS) databases. The first method has been used to classify input text as being relevant to one or more subject areas based on an analysis of the frequency distribution of its individual words. The second method has been used to… ▽ More

    Submitted 31 October, 2005; originally announced November 2005.

    Comments: Latex, 4 pages, 1 Figure. To be published in the Proceedings of the Conference "Astronomical Data Analysis Software & Systems XV" held October 2-5, 2005, in San Lorenzo de El Escorial, Spain

  47. The Effect of Use and Access on Citations

    Authors: Michael J. Kurtz, Guenther Eichhorn, Alberto Accomazzi, Carolyn Grant, Markus Demleitner, Edwin Henneken, Stephen S. Murray

    Abstract: It has been shown (S. Lawrence, 2001, Nature, 411, 521) that journal articles which have been posted without charge on the internet are more heavily cited than those which have not been. Using data from the NASA Astrophysics Data System (ads.harvard.edu) and from the ArXiv e-print archive at Cornell University (arXiv.org) we examine the causes of this effect.

    Submitted 14 March, 2005; originally announced March 2005.

    Comments: Accepted for publication in Information Processing & Management, special issue on scientometrics

    ACM Class: H.3.7

    Journal ref: Inform Process Manag 41:1395-1402 (2005)

  48. arXiv:cs/0401028  [pdf, ps, other

    cs.DL

    Automated Resolution of Noisy Bibliographic References

    Authors: Markus Demleitner, Michael Kurtz, Alberto Accomazzi, Günther Eichhorn, Carolyn S. Grant, Steven S. Murray

    Abstract: We describe a system used by the NASA Astrophysics Data System to identify bibliographic references obtained from scanned article pages by OCR methods with records in a bibliographic database. We analyze the process generating the noisy references and conclude that the three-step procedure of correcting the OCR results, parsing the corrected string and matching it against the database provides u… ▽ More

    Submitted 27 January, 2004; originally announced January 2004.

    Comments: 10 pages, 1 figure; accepted for publication in the proceedings of the 2004 Meeting of the International Federation of Classification Societies

    ACM Class: H.3.7; H.3.2