-
Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval
Authors:
Kazuya Ueki
Abstract:
Visual-semantic embedding is an interesting research topic because it is useful for various tasks, such as visual question answering (VQA), image-text retrieval, image captioning, and scene graph generation. In this paper, we focus on zero-shot image retrieval using sentences as queries and present a survey of the technological trends in this area. First, we provide a comprehensive overview of the…
▽ More
Visual-semantic embedding is an interesting research topic because it is useful for various tasks, such as visual question answering (VQA), image-text retrieval, image captioning, and scene graph generation. In this paper, we focus on zero-shot image retrieval using sentences as queries and present a survey of the technological trends in this area. First, we provide a comprehensive overview of the history of the technology, starting with a discussion of the early studies of image-to-text matching and how the technology has evolved over time. In addition, a description of the datasets commonly used in experiments and a comparison of the evaluation results of each method are presented. We also introduce the implementation available on github for use in confirming the accuracy of experiments and for further improvement. We hope that this survey paper will encourage researchers to further develop their research on bridging images and languages.
△ Less
Submitted 28 September, 2021; v1 submitted 16 May, 2021;
originally announced May 2021.
-
Survey on Deep Learning-based Kuzushiji Recognition
Authors:
Kazuya Ueki,
Tomoka Kojima
Abstract:
Owing to the overwhelming accuracy of the deep learning method demonstrated at the 2012 image classification competition, deep learning has been successfully applied to a variety of other tasks. The high-precision detection and recognition of Kuzushiji, a Japanese cursive script used for transcribing historical documents, has been made possible through the use of deep learning. In recent years, co…
▽ More
Owing to the overwhelming accuracy of the deep learning method demonstrated at the 2012 image classification competition, deep learning has been successfully applied to a variety of other tasks. The high-precision detection and recognition of Kuzushiji, a Japanese cursive script used for transcribing historical documents, has been made possible through the use of deep learning. In recent years, competitions on Kuzushiji recognition have been held, and many researchers have proposed various recognition methods. This study examines recent research trends, current problems, and future prospects in Kuzushiji recognition using deep learning.
△ Less
Submitted 19 July, 2020;
originally announced July 2020.
-
Stochastic Modeling of 3-D Compositional Distribution in the Crust with Bayesian Inference and Application to Geoneutrino Observation in Japan
Authors:
Nozomu Takeuchi,
Kenta Ueki,
Tsuyoshi Iizuka,
Jun Nagao,
Akiko Tanaka,
Sanshiro Enomoto,
Yutaka Shirahata,
Hiroko Watanabe,
Makoto Yamano,
Hiroyuki K. M. Tanaka
Abstract:
Geoneutrino observations, first achieved by KamLAND in 2005 and followed by Borexino in 2010, have accumulated statistics and improved sensitivity for more than ten years. The uncertainty of the geoneutrino flux at the surface is now reduced to a level small enough to set useful constraints on U and Th abundances in the bulk silicate earth (BSE). However, in order to make inferences on earth's com…
▽ More
Geoneutrino observations, first achieved by KamLAND in 2005 and followed by Borexino in 2010, have accumulated statistics and improved sensitivity for more than ten years. The uncertainty of the geoneutrino flux at the surface is now reduced to a level small enough to set useful constraints on U and Th abundances in the bulk silicate earth (BSE). However, in order to make inferences on earth's compositional model, the contributions from the local crust need to be understood within a similar uncertainty. Here we develop a new method to construct a stochastic crustal composition model utilizing Bayesian inference. While the methodology has general applicability, it incorporates all the local uniqueness in its probabilistic framework. Unlike common approaches for this type of problem, our method does not depend on crustal segmentation into upper, (middle) and lower, whose classification and boundaries are not always well defined. We also develop a new modeling method to infer rock composition distributions that conserve mass balance and therefore do not bias the results. Combined with a new vast collection of geochemical data for rock samples in the Japan arc, we apply this method to geoneutrino observation at Kamioka, Japan. Currently a difficulty remains in the handling of correlations in the flux integration; we conservatively assume maximum correlation, which leads to large flux estimation errors of 60~70%. Despite the large errors, this is the first local crustal model for geoneutrino flux prediction with probabilistic error estimation in a reproducible way.
△ Less
Submitted 4 January, 2019;
originally announced January 2019.
-
Geochemical discrimination and characteristics of magmatic tectonic settings; a machine learning-based approach
Authors:
Kenta Ueki,
Hideitsu Hino,
Tatsu Kuwatani
Abstract:
Geochemically discriminating between magmatism in different tectonic settings remains a fundamental part of understanding the processes of magma generation within the Earth's mantle. Here, we present an approach where machine-learning (ML) methods are used for quantitative tectonic discrimination and feature selection using global geochemical datasets containing data for volcanic rocks generated i…
▽ More
Geochemically discriminating between magmatism in different tectonic settings remains a fundamental part of understanding the processes of magma generation within the Earth's mantle. Here, we present an approach where machine-learning (ML) methods are used for quantitative tectonic discrimination and feature selection using global geochemical datasets containing data for volcanic rocks generated in eight different tectonic settings. This study uses support vector machine, random forest, and sparse multinomial regression (SMR) approaches. All these ML methods with data for 20 elements and 5 isotopic ratios allowed the successful geochemical discrimination between igneous rocks formed in eight different tectonic settings with a discriminant ratio better than 83% for all settings barring oceanic plateaus and back-arc basins. SMR is a particularly powerful and interpretable ML method because it quantitatively identifies geochemical signatures that characterize the tectonic settings of interest and the characteristics of each sample as a probability of the membership of the sample for each setting. We also present the most representative basalt composition for each tectonic setting. The new data provide reference points for future geochemical discussions. Our results indicate that at least 17 elements and isotopic ratios are required to characterize each tectonic setting, suggesting that geochemical tectonic discrimination cannot be achieved using only a small number of elemental compositions and/or isotopic ratios. The results show that volcanic rocks formed in different tectonic settings have unique geochemical signatures, indicating that both volcanic rock geochemistry and magma generation processes are closely connected to the tectonic setting.
△ Less
Submitted 4 April, 2018; v1 submitted 25 December, 2017;
originally announced December 2017.