-
Burst Super-Resolution with Diffusion Models for Improving Perceptual Quality
Authors:
Kyotaro Tokoro,
Kazutoshi Akita,
Norimichi Ukita
Abstract:
While burst LR images are useful for improving the SR image quality compared with a single LR image, prior SR networks accepting the burst LR images are trained in a deterministic manner, which is known to produce a blurry SR image. In addition, it is difficult to perfectly align the burst LR images, making the SR image more blurry. Since such blurry images are perceptually degraded, we aim to rec…
▽ More
While burst LR images are useful for improving the SR image quality compared with a single LR image, prior SR networks accepting the burst LR images are trained in a deterministic manner, which is known to produce a blurry SR image. In addition, it is difficult to perfectly align the burst LR images, making the SR image more blurry. Since such blurry images are perceptually degraded, we aim to reconstruct the sharp high-fidelity boundaries. Such high-fidelity images can be reconstructed by diffusion models. However, prior SR methods using the diffusion model are not properly optimized for the burst SR task. Specifically, the reverse process starting from a random sample is not optimized for image enhancement and restoration methods, including burst SR. In our proposed method, on the other hand, burst LR features are used to reconstruct the initial burst SR image that is fed into an intermediate step in the diffusion model. This reverse process from the intermediate step 1) skips diffusion steps for reconstructing the global structure of the image and 2) focuses on steps for refining detailed textures. Our experimental results demonstrate that our method can improve the scores of the perceptual quality metrics. Code: https://github.com/placerkyo/BSRD
△ Less
Submitted 8 April, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Inpainting-Driven Mask Optimization for Object Removal
Authors:
Kodai Shimosato,
Norimichi Ukita
Abstract:
This paper proposes a mask optimization method for improving the quality of object removal using image inpainting. While many inpainting methods are trained with a set of random masks, a target for inpainting may be an object, such as a person, in many realistic scenarios. This domain gap between masks in training and inference images increases the difficulty of the inpainting task. In our method,…
▽ More
This paper proposes a mask optimization method for improving the quality of object removal using image inpainting. While many inpainting methods are trained with a set of random masks, a target for inpainting may be an object, such as a person, in many realistic scenarios. This domain gap between masks in training and inference images increases the difficulty of the inpainting task. In our method, this domain gap is resolved by training the inpainting network with object masks extracted by segmentation, and such object masks are also used in the inference step. Furthermore, to optimize the object masks for inpainting, the segmentation network is connected to the inpainting network and end-to-end trained to improve the inpainting performance. The effect of this end-to-end training is further enhanced by our mask expansion loss for achieving the trade-off between large and small masks. Experimental results demonstrate the effectiveness of our method for better object removal using image inpainting.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Time-series Initialization and Conditioning for Video-agnostic Stabilization of Video Super-Resolution using Recurrent Networks
Authors:
Hiroshi Mori,
Norimichi Ukita
Abstract:
A Recurrent Neural Network (RNN) for Video Super Resolution (VSR) is generally trained with randomly clipped and cropped short videos extracted from original training videos due to various challenges in learning RNNs. However, since this RNN is optimized to super-resolve short videos, VSR of long videos is degraded due to the domain gap. Our preliminary experiments reveal that such degradation cha…
▽ More
A Recurrent Neural Network (RNN) for Video Super Resolution (VSR) is generally trained with randomly clipped and cropped short videos extracted from original training videos due to various challenges in learning RNNs. However, since this RNN is optimized to super-resolve short videos, VSR of long videos is degraded due to the domain gap. Our preliminary experiments reveal that such degradation changes depending on the video properties, such as the video length and dynamics. To avoid this degradation, this paper proposes the training strategy of RNN for VSR that can work efficiently and stably independently of the video length and dynamics. The proposed training strategy stabilizes VSR by training a VSR network with various RNN hidden states changed depending on the video properties. Since computing such a variety of hidden states is time-consuming, this computational cost is reduced by reusing the hidden states for efficient training. In addition, training stability is further improved with frame-number conditioning. Our experimental results demonstrate that the proposed method performed better than base methods in videos with various lengths and dynamics.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Depth Estimation fusing Image and Radar Measurements with Uncertain Directions
Authors:
Masaya Kotani,
Takeru Oba,
Norimichi Ukita
Abstract:
This paper proposes a depth estimation method using radar-image fusion by addressing the uncertain vertical directions of sparse radar measurements. In prior radar-image fusion work, image features are merged with the uncertain sparse depths measured by radar through convolutional layers. This approach is disturbed by the features computed with the uncertain radar depths. Furthermore, since the fe…
▽ More
This paper proposes a depth estimation method using radar-image fusion by addressing the uncertain vertical directions of sparse radar measurements. In prior radar-image fusion work, image features are merged with the uncertain sparse depths measured by radar through convolutional layers. This approach is disturbed by the features computed with the uncertain radar depths. Furthermore, since the features are computed with a fully convolutional network, the uncertainty of each depth corresponding to a pixel is spread out over its surrounding pixels. Our method avoids this problem by computing features only with an image and conditioning the features pixelwise with the radar depth. Furthermore, the set of possibly correct radar directions is identified with reliable LiDAR measurements, which are available only in the training stage. Our method improves training data by learning only these possibly correct radar directions, while the previous method trains raw radar measurements, including erroneous measurements. Experimental results demonstrate that our method can improve the quantitative and qualitative results compared with its base method using radar-image fusion.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
NTIRE 2023 Image Shadow Removal Challenge Technical Report: Team IIM_TTI
Authors:
Yuki Kondo,
Riku Miyata,
Fuma Yasue,
Taito Naruki,
Norimichi Ukita
Abstract:
In this paper, we analyze and discuss ShadowFormer in preparation for the NTIRE2023 Shadow Removal Challenge [1], implementing five key improvements: image alignment, the introduction of a perceptual quality loss function, the semi-automatic annotation for shadow detection, joint learning of shadow detection and removal, and the introduction of new data augmentation technique "CutShadow" for shado…
▽ More
In this paper, we analyze and discuss ShadowFormer in preparation for the NTIRE2023 Shadow Removal Challenge [1], implementing five key improvements: image alignment, the introduction of a perceptual quality loss function, the semi-automatic annotation for shadow detection, joint learning of shadow detection and removal, and the introduction of new data augmentation technique "CutShadow" for shadow removal. Our method achieved scores of 0.196 (3rd out of 19) in LPIPS and 7.44 (4th out of 19) in the Mean Opinion Score (MOS).
△ Less
Submitted 14 March, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Learning Group Activity Features Through Person Attribute Prediction
Authors:
Chihiro Nakatani,
Hiroaki Kawashima,
Norimichi Ukita
Abstract:
This paper proposes Group Activity Feature (GAF) learning in which features of multi-person activity are learned as a compact latent vector. Unlike prior work in which the manual annotation of group activities is required for supervised learning, our method learns the GAF through person attribute prediction without group activity annotations. By learning the whole network in an end-to-end manner s…
▽ More
This paper proposes Group Activity Feature (GAF) learning in which features of multi-person activity are learned as a compact latent vector. Unlike prior work in which the manual annotation of group activities is required for supervised learning, our method learns the GAF through person attribute prediction without group activity annotations. By learning the whole network in an end-to-end manner so that the GAF is required for predicting the person attributes of people in a group, the GAF is trained as the features of multi-person activity. As a person attribute, we propose to use a person's action class and appearance features because the former is easy to annotate due to its simpleness, and the latter requires no manual annotation. In addition, we introduce a location-guided attribute prediction to disentangle the complex GAF for extracting the features of each target person properly. Various experimental results validate that our method outperforms SOTA methods quantitatively and qualitatively on two public datasets. Visualization of our GAF also demonstrates that our method learns the GAF representing fined-grained group activity classes. Code: https://github.com/chihina/GAFL-CVPR2024.
△ Less
Submitted 11 March, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
$|V_{us}|$ from kaon semileptonic form factor in $N_f = 2+1$ QCD at the physical point on (10 fm)$^4$
Authors:
Takeshi Yamazaki,
Ken-ichi Ishikawa,
Naruhito Ishizuka,
Yoshinobu Kuramashi,
Yusuke Namekawa,
Yusuke Taniguchi,
Naoya Ukita,
Tomoteru Yoshié for PACS Collaboration
Abstract:
We present a preliminary result of the kaon semileptonic form factor calculated at the smallest lattice spacing in the PACS10 configurations, whose physical volumes are more than (10 fm)$^4$ at the physical point. The configurations were generated using the Iwasaki gauge action and $N_f=2+1$ stout-smeared nonperturbatively $O(a)$ improved Wilson quark action at the three lattice spacings, 0.085, 0…
▽ More
We present a preliminary result of the kaon semileptonic form factor calculated at the smallest lattice spacing in the PACS10 configurations, whose physical volumes are more than (10 fm)$^4$ at the physical point. The configurations were generated using the Iwasaki gauge action and $N_f=2+1$ stout-smeared nonperturbatively $O(a)$ improved Wilson quark action at the three lattice spacings, 0.085, 0.063, and 0.041 fm. The value of $|V_{us}|$ in the continuum limit is estimated from our results including the preliminary one. We compare our result of $|V_{us}|$ with the previous results and those through the kaon leptonic decay.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Active Transfer Learning for Efficient Video-Specific Human Pose Estimation
Authors:
Hiromu Taketsugu,
Norimichi Ukita
Abstract:
Human Pose (HP) estimation is actively researched because of its wide range of applications. However, even estimators pre-trained on large datasets may not perform satisfactorily due to a domain gap between the training and test data. To address this issue, we present our approach combining Active Learning (AL) and Transfer Learning (TL) to adapt HP estimators to individual video domains efficient…
▽ More
Human Pose (HP) estimation is actively researched because of its wide range of applications. However, even estimators pre-trained on large datasets may not perform satisfactorily due to a domain gap between the training and test data. To address this issue, we present our approach combining Active Learning (AL) and Transfer Learning (TL) to adapt HP estimators to individual video domains efficiently. For efficient learning, our approach quantifies (i) the estimation uncertainty based on the temporal changes in the estimated heatmaps and (ii) the unnaturalness in the estimated full-body HPs. These quantified criteria are then effectively combined with the state-of-the-art representativeness criterion to select uncertain and diverse samples for efficient HP estimator learning. Furthermore, we reconsider the existing Active Transfer Learning (ATL) method to introduce novel ideas related to the retraining methods and Stopping Criteria (SC). Experimental results demonstrate that our method enhances learning efficiency and outperforms comparative methods. Our code is publicly available at: https://github.com/ImIntheMiddle/VATL4Pose-WACV2024
△ Less
Submitted 8 November, 2023;
originally announced November 2023.
-
Multimodal Active Measurement for Human Mesh Recovery in Close Proximity
Authors:
Takahiro Maeda,
Keisuke Takeshita,
Norimichi Ukita,
Kazuhito Tanaka
Abstract:
For physical human-robot interactions (pHRI), a robot needs to estimate the accurate body pose of a target person. However, in these pHRI scenarios, the robot cannot fully observe the target person's body with equipped cameras because the target person must be close to the robot for physical interaction. This close distance leads to severe truncation and occlusions and thus results in poor accurac…
▽ More
For physical human-robot interactions (pHRI), a robot needs to estimate the accurate body pose of a target person. However, in these pHRI scenarios, the robot cannot fully observe the target person's body with equipped cameras because the target person must be close to the robot for physical interaction. This close distance leads to severe truncation and occlusions and thus results in poor accuracy of human pose estimation. For better accuracy in this challenging environment, we propose an active measurement and sensor fusion framework of the equipped cameras with touch and ranging sensors such as 2D LiDAR. Touch and ranging sensor measurements are sparse but reliable and informative cues for localizing human body parts. In our active measurement process, camera viewpoints and sensor placements are dynamically optimized to measure body parts with higher estimation uncertainty, which is closely related to truncation or occlusion. In our sensor fusion process, assuming that the measurements of touch and ranging sensors are more reliable than the camera-based estimations, we fuse the sensor measurements to the camera-based estimated pose by aligning the estimated pose towards the measured points. Our proposed method outperformed previous methods on the standard occlusion benchmark with simulated active measurement. Furthermore, our method reliably estimated human poses using a real robot, even with practical constraints such as occlusion by blankets.
△ Less
Submitted 19 July, 2024; v1 submitted 12 October, 2023;
originally announced October 2023.
-
Fast Inference and Update of Probabilistic Density Estimation on Trajectory Prediction
Authors:
Takahiro Maeda,
Norimichi Ukita
Abstract:
Safety-critical applications such as autonomous vehicles and social robots require fast computation and accurate probability density estimation on trajectory prediction. To address both requirements, this paper presents a new normalizing flow-based trajectory prediction model named FlowChain. FlowChain is a stack of conditional continuously-indexed flows (CIFs) that are expressive and allow analyt…
▽ More
Safety-critical applications such as autonomous vehicles and social robots require fast computation and accurate probability density estimation on trajectory prediction. To address both requirements, this paper presents a new normalizing flow-based trajectory prediction model named FlowChain. FlowChain is a stack of conditional continuously-indexed flows (CIFs) that are expressive and allow analytical probability density computation. This analytical computation is faster than the generative models that need additional approximations such as kernel density estimation. Moreover, FlowChain is more accurate than the Gaussian mixture-based models due to fewer assumptions on the estimated density. FlowChain also allows a rapid update of estimated probability densities. This update is achieved by adopting the \textit{newest observed position} and reusing the flow transformations and its log-det-jacobians that represent the \textit{motion trend}. This update is completed in less than one millisecond because this reuse greatly omits the computational cost. Experimental results showed our FlowChain achieved state-of-the-art trajectory prediction accuracy compared to previous methods. Furthermore, our FlowChain demonstrated superiority in the accuracy and speed of density estimation. Our code is available at \url{https://github.com/meaten/FlowChain-ICCV2023}
△ Less
Submitted 17 August, 2023;
originally announced August 2023.
-
Interaction-aware Joint Attention Estimation Using People Attributes
Authors:
Chihiro Nakatani,
Hiroaki Kawashima,
Norimichi Ukita
Abstract:
This paper proposes joint attention estimation in a single image. Different from related work in which only the gaze-related attributes of people are independently employed, (I) their locations and actions are also employed as contextual cues for weighting their attributes, and (ii) interactions among all of these attributes are explicitly modeled in our method. For the interaction modeling, we pr…
▽ More
This paper proposes joint attention estimation in a single image. Different from related work in which only the gaze-related attributes of people are independently employed, (I) their locations and actions are also employed as contextual cues for weighting their attributes, and (ii) interactions among all of these attributes are explicitly modeled in our method. For the interaction modeling, we propose a novel Transformer-based attention network to encode joint attention as low-dimensional features. We introduce a specialized MLP head with positional embedding to the Transformer so that it predicts pixelwise confidence of joint attention for generating the confidence heatmap. This pixelwise prediction improves the heatmap accuracy by avoiding the ill-posed problem in which the high-dimensional heatmap is predicted from the low-dimensional features. The estimated joint attention is further improved by being integrated with general image-based attention estimation. Our method outperforms SOTA methods quantitatively in comparative experiments. Code: https://anonymous.4open.science/r/anonymized_codes-ECA4.
△ Less
Submitted 10 August, 2023;
originally announced August 2023.
-
MVA2023 Small Object Detection Challenge for Spotting Birds: Dataset, Methods, and Results
Authors:
Yuki Kondo,
Norimichi Ukita,
Takayuki Yamaguchi,
Hao-Yu Hou,
Mu-Yi Shen,
Chia-Chi Hsu,
En-Ming Huang,
Yu-Chen Huang,
Yu-Cheng Xia,
Chien-Yao Wang,
Chun-Yi Lee,
Da Huo,
Marc A. Kastner,
Tingwei Liu,
Yasutomo Kawanishi,
Takatsugu Hirayama,
Takahiro Komamizu,
Ichiro Ide,
Yosuke Shinya,
Xinyao Liu,
Guang Liang,
Syusuke Yasui
Abstract:
Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image appearances of small objects. This paper proposes a new SOD dataset consisting of 39,070 images including 137,121 bird instances, which is called the S…
▽ More
Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image appearances of small objects. This paper proposes a new SOD dataset consisting of 39,070 images including 137,121 bird instances, which is called the Small Object Detection for Spotting Birds (SOD4SB) dataset. The detail of the challenge with the SOD4SB dataset is introduced in this paper. In total, 223 participants joined this challenge. This paper briefly introduces the award-winning methods. The dataset, the baseline code, and the website for evaluation on the public testset are publicly available.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.
-
R2-Diff: Denoising by diffusion as a refinement of retrieved motion for image-based motion prediction
Authors:
Takeru Oba,
Norimichi Ukita
Abstract:
Image-based motion prediction is one of the essential techniques for robot manipulation. Among the various prediction models, we focus on diffusion models because they have achieved state-of-the-art performance in various applications. In image-based motion prediction, diffusion models stochastically predict contextually appropriate motion by gradually denoising random Gaussian noise based on the…
▽ More
Image-based motion prediction is one of the essential techniques for robot manipulation. Among the various prediction models, we focus on diffusion models because they have achieved state-of-the-art performance in various applications. In image-based motion prediction, diffusion models stochastically predict contextually appropriate motion by gradually denoising random Gaussian noise based on the image context. While diffusion models are able to predict various motions by changing the random noise, they sometimes fail to predict a contextually appropriate motion based on the image because the random noise is sampled independently of the image context. To solve this problem, we propose R2-Diff. In R2-Diff, a motion retrieved from a dataset based on image similarity is fed into a diffusion model instead of random noise. Then, the retrieved motion is refined through the denoising process of the diffusion model. Since the retrieved motion is almost appropriate to the context, it becomes easier to predict contextually appropriate motion. However, traditional diffusion models are not optimized to refine the retrieved motion. Therefore, we propose the method of tuning the hyperparameters based on the distance of the nearest neighbor motion among the dataset to optimize the diffusion model for refinement. Furthermore, we propose an image-based retrieval method to retrieve the nearest neighbor motion in inference. Our proposed retrieval efficiently computes the similarity based on the image features along the motion trajectory. We demonstrate that R2-Diff accurately predicts appropriate motions and achieves high task success rates compared to recent state-of-the-art models in robot manipulation.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
-
Detached Eclipsing Binaries in Compact Hierarchical Triples: Triple-lined systems BD+44 2258 and KIC 06525196
Authors:
Ayush Moharana,
K. G. Hełminiak,
F. Marcadon,
T. Pawar,
M. Konacki,
N. Ukita,
E. Kambe,
H. Maehara
Abstract:
Compact Hierarchical Triples (CHT) are systems with the tertiary star orbiting the inner binary in an orbit shorter than 1000 days. CHT with an eclipsing binary as its inner binary can help us extract a multitude of information about all three stars in the system. In this study, we use independent observational techniques to estimate the orbital, stellar, and atmospheric parameters of two triple-l…
▽ More
Compact Hierarchical Triples (CHT) are systems with the tertiary star orbiting the inner binary in an orbit shorter than 1000 days. CHT with an eclipsing binary as its inner binary can help us extract a multitude of information about all three stars in the system. In this study, we use independent observational techniques to estimate the orbital, stellar, and atmospheric parameters of two triple-lined CHT: BD+442258 and KIC06525196. We find that the masses of stars in BD+442258 are $1.011\pm0.029 M_{\odot}$, $0.941\pm0.033 M_{\odot}$, and $0.907\pm0.065 M_{\odot}$ while in KIC06525196 the estimated masses are $1.0351\pm0.0055 M_{\odot}$, $0.9712\pm0.0039 M_{\odot}$, and $0.777\pm0.012 M_{\odot}$. Using spectral disentangling, we obtained individual spectra of all the stars and combined them with light curve modeling to obtain radii, metallicities, and temperatures. Using stellar evolution models from MESA, we constrain the log(age) of BD+442258 to be 9.89 and 9.49 for KIC06525196. Two stars in BD+442258 are found to be sub-giants while all three stars in KIC06525196 are main-sequence stars. We constrain the mutual inclinations to certain angles for BD+442258 and KIC06525196 using numerical integration. Integrating with tidal interaction schemes and stellar evolution models, we find that KIC06525196 is a stable system. But the inner binary of BD+442258 merges within 550 Myrs. The time of this merger is affected by the orientation of the tertiary, even rushing the collapse by 100 Myrs when the mutual inclination is close to 90 degrees.
△ Less
Submitted 22 March, 2023; v1 submitted 9 March, 2023;
originally announced March 2023.
-
Joint Learning of Blind Super-Resolution and Crack Segmentation for Realistic Degraded Images
Authors:
Yuki Kondo,
Norimichi Ukita
Abstract:
This paper proposes crack segmentation augmented by super resolution (SR) with deep neural networks. In the proposed method, a SR network is jointly trained with a binary segmentation network in an end-to-end manner. This joint learning allows the SR network to be optimized for improving segmentation results. For realistic scenarios, the SR network is extended from non-blind to blind for processin…
▽ More
This paper proposes crack segmentation augmented by super resolution (SR) with deep neural networks. In the proposed method, a SR network is jointly trained with a binary segmentation network in an end-to-end manner. This joint learning allows the SR network to be optimized for improving segmentation results. For realistic scenarios, the SR network is extended from non-blind to blind for processing a low-resolution image degraded by unknown blurs. The joint network is improved by our proposed two extra paths that further encourage the mutual optimization between SR and segmentation. Comparative experiments with State of The Art (SoTA) segmentation methods demonstrate the superiority of our joint learning, and various ablation studies prove the effects of our contributions.
△ Less
Submitted 25 February, 2024; v1 submitted 24 February, 2023;
originally announced February 2023.
-
KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer
Authors:
Kaikai Zhao,
Norimichi Ukita
Abstract:
Scaled dot-product attention applies a softmax function on the scaled dot-product of queries and keys to calculate weights and then multiplies the weights and values. In this work, we study how to improve the learning of scaled dot-product attention to improve the accuracy of DETR. Our method is based on the following observations: using ground truth foreground-background mask (GT Fg-Bg Mask) as a…
▽ More
Scaled dot-product attention applies a softmax function on the scaled dot-product of queries and keys to calculate weights and then multiplies the weights and values. In this work, we study how to improve the learning of scaled dot-product attention to improve the accuracy of DETR. Our method is based on the following observations: using ground truth foreground-background mask (GT Fg-Bg Mask) as additional cues in the weights/values learning enables learning much better weights/values; with better weights/values, better values/weights can be learned. We propose a triple-attention module in which the first attention is a plain scaled dot-product attention, the second/third attention generates high-quality weights/values (with the assistance of GT Fg-Bg Mask) and shares the values/weights with the first attention to improve the quality of values/weights. The second and third attentions are removed during inference. We call our method knowledge-sharing DETR (KS-DETR), which is an extension of knowledge distillation (KD) in the way that the improved weights and values of the teachers (the second and third attentions) are directly shared, instead of mimicked, by the student (the first attention) to enable more efficient knowledge transfer from the teachers to the student. Experiments on various DETR-like methods show consistent improvements over the baseline methods on the MS COCO benchmark. Code is available at https://github.com/edocanonymous/KS-DETR.
△ Less
Submitted 16 March, 2023; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Kernelized Back-Projection Networks for Blind Super Resolution
Authors:
Tomoki Yoshida,
Yuki Kondo,
Takahiro Maeda,
Kazutoshi Akita,
Norimichi Ukita
Abstract:
Since non-blind Super Resolution (SR) fails to super-resolve Low-Resolution (LR) images degraded by arbitrary degradations, SR with the degradation model is required. However, this paper reveals that non-blind SR that is trained simply with various blur kernels exhibits comparable performance as those with the degradation model for blind SR. This result motivates us to revisit high-performance non…
▽ More
Since non-blind Super Resolution (SR) fails to super-resolve Low-Resolution (LR) images degraded by arbitrary degradations, SR with the degradation model is required. However, this paper reveals that non-blind SR that is trained simply with various blur kernels exhibits comparable performance as those with the degradation model for blind SR. This result motivates us to revisit high-performance non-blind SR and extend it to blind SR with blur kernels. This paper proposes two SR networks by integrating kernel estimation and SR branches in an iterative end-to-end manner. In the first model, which is called the Kernel Conditioned Back-Projection Network (KCBPN), the low-dimensional kernel representations are estimated for conditioning the SR branch. In our second model, the Kernelized BackProjection Network (KBPN), a raw kernel is estimated and directly employed for modeling the image degradation. The estimated kernel is employed not only for back-propagating its residual but also for forward-propagating the residual to iterative stages. This forward-propagation encourages these stages to learn a variety of different features in different stages by focusing on pixels with large residuals in each stage. Experimental results validate the effectiveness of our proposed networks for kernel estimation and SR. We will release the code for this work.
△ Less
Submitted 27 October, 2023; v1 submitted 16 February, 2023;
originally announced February 2023.
-
Perturbative analysis of the Wess-Zumino flow
Authors:
Daisuke Kadoh,
Kengo Kikuchi,
Naoya Ukita
Abstract:
We investigate an interacting supersymmetric gradient flow in the Wess-Zumino model. Thanks to the nonrenormalization theorem and an appropriate initial condition, we find that any correlator of flowed fields is ultraviolet finite. This is shown at all orders of the perturbation theory using the power counting theorem for one-particle irreducible supergraphs. Since the model does not have the gaug…
▽ More
We investigate an interacting supersymmetric gradient flow in the Wess-Zumino model. Thanks to the nonrenormalization theorem and an appropriate initial condition, we find that any correlator of flowed fields is ultraviolet finite. This is shown at all orders of the perturbation theory using the power counting theorem for one-particle irreducible supergraphs. Since the model does not have the gauge symmetry, the mechanism of realizing the ultraviolet finiteness is quite different from that of the Yang-Mills flow, and this could provide further understanding of the gradient flow approach.
△ Less
Submitted 29 June, 2023; v1 submitted 14 February, 2023;
originally announced February 2023.
-
Data-Driven Stochastic Motion Evaluation and Optimization with Image by Spatially-Aligned Temporal Encoding
Authors:
Takeru Oba,
Norimichi Ukita
Abstract:
This paper proposes a probabilistic motion prediction method for long motions. The motion is predicted so that it accomplishes a task from the initial state observed in the given image. While our method evaluates the task achievability by the Energy-Based Model (EBM), previous EBMs are not designed for evaluating the consistency between different domains (i.e., image and motion in our method). Our…
▽ More
This paper proposes a probabilistic motion prediction method for long motions. The motion is predicted so that it accomplishes a task from the initial state observed in the given image. While our method evaluates the task achievability by the Energy-Based Model (EBM), previous EBMs are not designed for evaluating the consistency between different domains (i.e., image and motion in our method). Our method seamlessly integrates the image and motion data into the image feature domain by spatially-aligned temporal encoding so that features are extracted along the motion trajectory projected onto the image. Furthermore, this paper also proposes a data-driven motion optimization method, Deep Motion Optimizer (DMO), that works with EBM for motion prediction. Different from previous gradient-based optimizers, our self-supervised DMO alleviates the difficulty of hyper-parameter tuning to avoid local minima. The effectiveness of the proposed method is demonstrated with a variety of experiments with similar SOTA methods.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
Comprehensive spectroscopic and photometric study of pulsating eclipsing binary star AI Hya
Authors:
F. Kahraman Alicavus,
T. Pawar,
K. G. Hełminiak,
G. Handler,
A. Moharana,
F. Alicavus,
P. De Cat,
F. Leone,
G. Catanzaro,
M. Giarrusso,
N. Ukita,
E. Kambe
Abstract:
The pulsating eclipsing binaries are remarkable systems that provide an opportunity to probe the stellar interior and to determine the fundamental stellar parameters precisely. Especially the detached eclipsing binary systems with (a) pulsating component(s) are significant objects to understand the nature of the oscillations since the binary effects in these systems are negligible. Recent studies…
▽ More
The pulsating eclipsing binaries are remarkable systems that provide an opportunity to probe the stellar interior and to determine the fundamental stellar parameters precisely. Especially the detached eclipsing binary systems with (a) pulsating component(s) are significant objects to understand the nature of the oscillations since the binary effects in these systems are negligible. Recent studies based on space data have shown that the pulsation mechanisms of some oscillating stars are not completely understood. Hence, comprehensive studies of a number of pulsating stars within detached eclipsing binaries are important. In this study, we present a detailed analysis of the pulsating detached eclipsing binary system AI Hya which was studied by two independent groups with different methods. We carried out a spectroscopic survey to estimate the orbital parameters via radial velocity measurements and the atmospheric parameters of each binary component using the composite and/or disentangled spectra. We found that the more luminous component of the system is a massive, cool and chemically normal star while the hotter binary component is a slightly metal-rich object. The fundamental parameters of AI Hya were determined by the analysis of binary variations and subsequently used in the evolutionary modelling. Consequently, we obtained the age of the system as 850 $\pm$ 20 Myr and found that both binary components are situated in the Delta Scuti instability strip. The frequency analysis revealed pulsation frequencies between the 5.5 - 13.0 d$^{-1}$ and we tried to estimate which binary component is the pulsating one. However, it turned out that those frequencies could originate from both binary components.
△ Less
Submitted 11 January, 2023;
originally announced January 2023.
-
Momentum transfer dependence of kaon semileptonic form factor on (10 fm)$^4$ at the physical point
Authors:
Takeshi Yamazaki,
Ken-ichi Ishikawa,
Naruhito Ishizuka,
Yoshinobu Kuramashi,
Yusuke Namekawa,
Yusuke Taniguchi,
Naoya Ukita,
Tomoteru Yoshié for PACS Collaboration
Abstract:
We calculate the kaon semileptonic form factors using the two sets of the PACS10 configuration, whose physical volumes are more than (10 fm)$^4$ at the physical point. The lattice spacings are 0.063 and 0.085 fm. The configurations were generated using the Iwasaki gauge action and $N_f=2+1$ stout-smeared nonperturbatively $O(a)$-improved Wilson quark action. From the momentum transfer dependence o…
▽ More
We calculate the kaon semileptonic form factors using the two sets of the PACS10 configuration, whose physical volumes are more than (10 fm)$^4$ at the physical point. The lattice spacings are 0.063 and 0.085 fm. The configurations were generated using the Iwasaki gauge action and $N_f=2+1$ stout-smeared nonperturbatively $O(a)$-improved Wilson quark action. From the momentum transfer dependence of the form factors, we evaluate the slope and curvature for the form factors at the zero momentum transfer. Furthermore, we calculate the phase space factor, which is used to obtain $|V_{us}|$ through the kaon semileptonic decay. These results are compared with previous lattice results and experimental values.
△ Less
Submitted 30 November, 2022;
originally announced December 2022.
-
Actor-identified Spatiotemporal Action Detection -- Detecting Who Is Doing What in Videos
Authors:
Fan Yang,
Norimichi Ukita,
Sakriani Sakti,
Satoshi Nakamura
Abstract:
The success of deep learning on video Action Recognition (AR) has motivated researchers to progressively promote related tasks from the coarse level to the fine-grained level. Compared with conventional AR which only predicts an action label for the entire video, Temporal Action Detection (TAD) has been investigated for estimating the start and end time for each action in videos. Taking TAD a step…
▽ More
The success of deep learning on video Action Recognition (AR) has motivated researchers to progressively promote related tasks from the coarse level to the fine-grained level. Compared with conventional AR which only predicts an action label for the entire video, Temporal Action Detection (TAD) has been investigated for estimating the start and end time for each action in videos. Taking TAD a step further, Spatiotemporal Action Detection (SAD) has been studied for localizing the action both spatially and temporally in videos. However, who performs the action, is generally ignored in SAD, while identifying the actor could also be important. To this end, we propose a novel task, Actor-identified Spatiotemporal Action Detection (ASAD), to bridge the gap between SAD and actor identification.
In ASAD, we not only detect the spatiotemporal boundary for instance-level action but also assign the unique ID to each actor. To approach ASAD, Multiple Object Tracking (MOT) and Action Classification (AC) are two fundamental elements. By using MOT, the spatiotemporal boundary of each actor is obtained and assigned to a unique actor identity. By using AC, the action class is estimated within the corresponding spatiotemporal boundary. Since ASAD is a new task, it poses many new challenges that cannot be addressed by existing methods: i) no dataset is specifically created for ASAD, ii) no evaluation metrics are designed for ASAD, iii) current MOT performance is the bottleneck to obtain satisfactory ASAD results. To address those problems, we contribute to i) annotate a new ASAD dataset, ii) propose ASAD evaluation metrics by considering multi-label actions and actor identification, iii) improve the data association strategies in MOT to boost the MOT performance, which leads to better ASAD results. The code is available at https://github.com/fandulu/ASAD.
△ Less
Submitted 7 September, 2022; v1 submitted 27 August, 2022;
originally announced August 2022.
-
Supersymmetric gradient flow in 4d N=1 SQCD
Authors:
Daisuke Kadoh,
Naoya Ukita
Abstract:
A supersymmetric gradient flow for four-dimensional N=1 supersymmetric QCD (SQCD) is proposed. The flow equation is given in both the superfield and component fields of the Wess-Zumino gauge. The superfield flow equation is defined for each of the gauge and matter multiplets individually. Adding a gauge fixing, the component-field flow equation is defined in the Wess-Zumino gauge in a gauge covari…
▽ More
A supersymmetric gradient flow for four-dimensional N=1 supersymmetric QCD (SQCD) is proposed. The flow equation is given in both the superfield and component fields of the Wess-Zumino gauge. The superfield flow equation is defined for each of the gauge and matter multiplets individually. Adding a gauge fixing, the component-field flow equation is defined in the Wess-Zumino gauge in a gauge covariant manner. We find that the latter equation is supersymmetric in a sense that the commutator of the flow time derivative and the supersymmetry transformation vanishes up to a gauge transformation. We also discuss a simplified flow by using the gradient of supersymmetric Yang-Mills (SYM) action instead of using SQCD action to define a gauge multiplet flow.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
$K_{\ell 3}$ form factors at the physical point: Toward the continuum limit
Authors:
Ken-ichi Ishikawa,
Naruhito Ishizuka,
Yoshinobu Kuramashi,
Yusuke Namekawa,
Yusuke Taniguchi,
Naoya Ukita,
Takeshi Yamazaki,
Tomoteru Yoshié for PACS Collaboration
Abstract:
We present updated results for the form factors of the kaon semileptonic $(K_{\ell 3})$ decay process calculated with $N_f = 2 + 1$ nonperturbatively $O(a)$-improved Wilson quark action and Iwasaki gauge action at the physical point on large volumes of more than (10 fm)$^4$. In addition to our previous calculation at the lattice spacing $a = 0.085$ fm, we perform a calculation at the second lattic…
▽ More
We present updated results for the form factors of the kaon semileptonic $(K_{\ell 3})$ decay process calculated with $N_f = 2 + 1$ nonperturbatively $O(a)$-improved Wilson quark action and Iwasaki gauge action at the physical point on large volumes of more than (10 fm)$^4$. In addition to our previous calculation at the lattice spacing $a = 0.085$ fm, we perform a calculation at the second lattice spacing of $0.063$ fm. Using the results for the form factors extracted from 3-point functions with the local and also conserved vector currents at the two lattice spacings, continuum extrapolation and interpolation of the momentum transfer are carried out simultaneously to obtain the value of the form factor $f_+(0)$ at the zero momentum transfer in the continuum limit. After investigation of stability of $f_+(0)$ against several fit forms and different data, we obtain $f_+(0) = 0.9615(10)(^{+47}_{\ -3})(5)$, where the first, second, and third errors are statistical, systematic errors from choice of the fit forms and isospin breaking effect, respectively. Combining our value of $f_+(0)$ and experimental input of the $K_{\ell 3}$ decay, one of the Cabibbo-Kobayashi-Maskawa matrix elements $|V_{us}|$ is determined as $|V_{us}| = 0.2252(^{\ +5}_{-12})$, whose error contains the experimental one as well as that in the lattice calculation. This value is reasonably consistent with the ones determined from recent lattice QCD results of $f_+(0)$ and also the one determined through the kaon leptonic decay process. We observe some tension between our value and $|V_{us}|$ evaluated from the unitarity of the CKM matrix with $|V_{ud}|$, while it depends on the size of the error of $|V_{ud}|$. It is also found that $|V_{us}|$ determined with our phase space integrals through six $K_{\ell 3}$ decay processes is consistent with the above one using $f_+(0)$.
△ Less
Submitted 21 November, 2022; v1 submitted 17 June, 2022;
originally announced June 2022.
-
MotionAug: Augmentation with Physical Correction for Human Motion Prediction
Authors:
Takahiro Maeda,
Norimichi Ukita
Abstract:
This paper presents a motion data augmentation scheme incorporating motion synthesis encouraging diversity and motion correction imposing physical plausibility. This motion synthesis consists of our modified Variational AutoEncoder (VAE) and Inverse Kinematics (IK). In this VAE, our proposed sampling-near-samples method generates various valid motions even with insufficient training motion data. O…
▽ More
This paper presents a motion data augmentation scheme incorporating motion synthesis encouraging diversity and motion correction imposing physical plausibility. This motion synthesis consists of our modified Variational AutoEncoder (VAE) and Inverse Kinematics (IK). In this VAE, our proposed sampling-near-samples method generates various valid motions even with insufficient training motion data. Our IK-based motion synthesis method allows us to generate a variety of motions semi-automatically. Since these two schemes generate unrealistic artifacts in the synthesized motions, our motion correction rectifies them. This motion correction scheme consists of imitation learning with physics simulation and subsequent motion debiasing. For this imitation learning, we propose the PD-residual force that significantly accelerates the training process. Furthermore, our motion debiasing successfully offsets the motion bias induced by imitation learning to maximize the effect of augmentation. As a result, our method outperforms previous noise-based motion augmentation methods by a large margin on both Recurrent Neural Network-based and Graph Convolutional Network-based human motion prediction models. The code is available at https://github.com/meaten/MotionAug.
△ Less
Submitted 17 August, 2023; v1 submitted 17 March, 2022;
originally announced March 2022.
-
A Very Metal-poor RR Lyrae Star with a Disk Orbit Found in the Solar Neighborhood
Authors:
Noriyuki Matsunaga,
Akinori Itane,
Kohei Hattori,
Juliana Crestani,
Vittorio Braga,
Giuseppe Bono,
Daisuke Taniguchi,
Junichi Baba,
Hiroyuki Maehara,
Nobuharu Ukita,
Tsuyoshi Sakamoto,
Naoto Kobayashi,
Tsutomu Aoki,
Takao Soyano,
Ken'ichi Tarusawa,
Yuki Sarugaku,
Hiroyuki Mito,
Shigeyuki Sako,
Mamoru Doi,
Yoshikazu Nakada,
Natsuko Izumi,
Yoshifusa Ita,
Hiroki Onozato,
Mingjie Jian,
Sohei Kondo
, et al. (6 additional authors not shown)
Abstract:
Metal-deficient stars are important tracers for understanding the early formation of the Galaxy. Recent large-scale surveys with both photometric and spectroscopic data have reported an increasing number of metal-deficient stars whose kinematic features are consistent with those of the disk stellar populations. We report the discovery of an RR~Lyrae variable (hereafter RRL) that is located within…
▽ More
Metal-deficient stars are important tracers for understanding the early formation of the Galaxy. Recent large-scale surveys with both photometric and spectroscopic data have reported an increasing number of metal-deficient stars whose kinematic features are consistent with those of the disk stellar populations. We report the discovery of an RR~Lyrae variable (hereafter RRL) that is located within the thick disk and has an orbit consistent with the thick-disk kinematics. Our target RRL (HD 331986) is located at around 1 kpc from the Sun and, with V=11.3, is among the 130 brightest RRLs known so far. However, this object was scarcely studied because it is in the midplane of the Galaxy, the Galactic latitude around -1 deg. Its near-infrared spectrum (0.91-1.32 micron) shows no absorption line except hydrogen lines of the Paschen series, suggesting [Fe/H] less than -2.5. It is the most metal-deficient RRL, at least, among the RRLs whose orbits are consistent with the disk kinematics, although we cannot determine to which of the disk and the halo it belongs. This unique RRL would provide us with essential clues for studying the early formation of stars in the inner Galaxy with further investigations, including high-resolution optical spectroscopy.
△ Less
Submitted 14 January, 2022;
originally announced January 2022.
-
Calculation of kaon semileptonic form factor with the PACS10 configuration
Authors:
Takeshi Yamazaki,
Ken-ichi Ishikawa,
Naruhito Ishizuka,
Yoshinobu Kuramashi,
Yoshifumi Nakamura,
Yusuke Namekawa,
Yusuke Taniguchi,
Naoya Ukita,
Tomoteru Yoshié for PACS Collaboration
Abstract:
We present preliminary results for the kaon semileptonic form factors using the PACS10 configurations, whose physical volume is more than (10 fm)$^3$ at the physical point with the lattice spacings of 0.085 and 0.064 fm. The configurations were generated using the Iwasaki gauge action and $N_f=2+1$ stout-smeared Clover quark action. For the continuum extrapolation, we calculate the form factors wi…
▽ More
We present preliminary results for the kaon semileptonic form factors using the PACS10 configurations, whose physical volume is more than (10 fm)$^3$ at the physical point with the lattice spacings of 0.085 and 0.064 fm. The configurations were generated using the Iwasaki gauge action and $N_f=2+1$ stout-smeared Clover quark action. For the continuum extrapolation, we calculate the form factors with the local and conserved vector currents. The form factors in the two lattice spacings are extrapolated to the continuum limit using a fit function based on the NLO SU(3) ChPT formula with terms corresponding to finite lattice spacing effects. The value of $|V_{us}|$ is determined using our preliminary result of the form factor at the zero momentum transfer in the continuum limit. The result is compared with recent lattice results, and also predictions of the standard model from the unitarity of the Cabibbo-Kobayashi-Maskawa (CKM) matrix.
△ Less
Submitted 1 November, 2021;
originally announced November 2021.
-
Orbital and physical parameters of eclipsing binaries from the ASAS catalogue -- XII. A sample of systems with $K2$ photometry
Authors:
K. G. Hełminiak,
A. Moharana,
T. Pawar,
N. Ukita,
P. Sybilski,
N. Espinoza,
E. Kambe,
M. Ratajczak,
A. Jordán,
H. Maehara,
R. Brahm,
S. K. Kozłowski,
M. Konacki
Abstract:
We present results of the analysis of light and radial velocity (RV) curves of eight detached eclipsing binaries observed by the All-Sky Automated Survey, which we have followed up with high-resolution spectroscopy, and were later observed by the $Kepler$ satellite as part of the $K2$ mission. The RV measurements came from spectra obtained with OAO-188/HIDES, MPG-2.2m/FEROS, SMARTS 1.5m/CHIRON, Eu…
▽ More
We present results of the analysis of light and radial velocity (RV) curves of eight detached eclipsing binaries observed by the All-Sky Automated Survey, which we have followed up with high-resolution spectroscopy, and were later observed by the $Kepler$ satellite as part of the $K2$ mission. The RV measurements came from spectra obtained with OAO-188/HIDES, MPG-2.2m/FEROS, SMARTS 1.5m/CHIRON, Euler/CORALIE, ESO-3.6m/HARPS, and OHP-1.93/ELODIE instruments. The $K2$ time-series photometry was analyzed with the JKTEBOP code, with out-of-eclipse modulations of different origin taken into account. Individual component spectra were retrieved with the FD3 code, and analyzed with the code iSpec in order to determine effective temperatures and metallicities. Absolute values of masses, radii, and other stellar parameters are calculated, as well as ages, found through isochrone fitting. For five systems such analysis has been done for the first time. The presented sample consists of a variety of stars, from low-mass dwarfs, through G and F-type Main Sequence objects, to evolved active sub-giants, one of which is found to be crossing the Hertzsprung gap. One target may contain a $γ$ Dor-type pulsator, two more are parts of higher-order multiples, and spectra of their tertiaries were also retrieved and used to constrain the properties of these systems.
△ Less
Submitted 12 October, 2021;
originally announced October 2021.
-
NTIRE 2021 Challenge on Burst Super-Resolution: Methods and Results
Authors:
Goutam Bhat,
Martin Danelljan,
Radu Timofte,
Kazutoshi Akita,
Wooyeong Cho,
Haoqiang Fan,
Lanpeng Jia,
Daeshik Kim,
Bruno Lecouat,
Youwei Li,
Shuaicheng Liu,
Ziluan Liu,
Ziwei Luo,
Takahiro Maeda,
Julien Mairal,
Christian Micheloni,
Xuan Mo,
Takeru Oba,
Pavel Ostyakov,
Jean Ponce,
Sanghyeok Son,
Jian Sun,
Norimichi Ukita,
Rao Muhammad Umer,
Youliang Yan
, et al. (3 additional authors not shown)
Abstract:
This paper reviews the NTIRE2021 challenge on burst super-resolution. Given a RAW noisy burst as input, the task in the challenge was to generate a clean RGB image with 4 times higher resolution. The challenge contained two tracks; Track 1 evaluating on synthetically generated data, and Track 2 using real-world bursts from mobile camera. In the final testing phase, 6 teams submitted results using…
▽ More
This paper reviews the NTIRE2021 challenge on burst super-resolution. Given a RAW noisy burst as input, the task in the challenge was to generate a clean RGB image with 4 times higher resolution. The challenge contained two tracks; Track 1 evaluating on synthetically generated data, and Track 2 using real-world bursts from mobile camera. In the final testing phase, 6 teams submitted results using a diverse set of solutions. The top-performing methods set a new state-of-the-art for the burst super-resolution task.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Eclipsing Binary Populations across the Northern Galactic Plane from the KISOGP survey
Authors:
Fangzhou Ren,
Richard de Grijs,
Huawei Zhang,
Licai Deng,
Xiaodian Chen,
Noriyuki Matsunaga,
Chao Liu,
Weijia Sun,
Hiroyuki Maehara,
Nobuharu Ukita,
Naoto Kobayashi
Abstract:
We present a catalog of eclipsing binaries in the northern Galactic Plane from the Kiso Wide-Field Camera Intensive Survey of the Galactic Plane (KISOGP). We visually identified 7055 eclipsing binaries spread across $\sim$330 square degrees, including 4197 W Ursa Majoris/EW-, 1458 $β$ Lyrae/EB-, and 1400 Algol/EA-type eclipsing binaries. For all systems, $I$-band light curves were used to obtain a…
▽ More
We present a catalog of eclipsing binaries in the northern Galactic Plane from the Kiso Wide-Field Camera Intensive Survey of the Galactic Plane (KISOGP). We visually identified 7055 eclipsing binaries spread across $\sim$330 square degrees, including 4197 W Ursa Majoris/EW-, 1458 $β$ Lyrae/EB-, and 1400 Algol/EA-type eclipsing binaries. For all systems, $I$-band light curves were used to obtain accurate system parameters. We derived the distances and extinction values for the EW-type objects from their period--luminosity relation. We also obtained the structure of the thin disk from the distribution of our sample of eclipsing binary systems, combined with those of high-mass star-forming regions and Cepheid tracers. We found that the thin disk is inhomogeneous in number density as a function of Galactic longitude. Using this new set of distance tracers, we constrain the detailed structure of the thin disk. Finally, we report a global parallax zero-point offset of $ Δπ=-42.1\pm1.9\mbox{(stat.)}\pm12.9\mbox{(syst.)}$ $μ$as between our carefully calibrated EW-type eclipsing binary positions and those provided by Gaia Early Data Release 3. Implementation of the officially recommended parallax zero-point correction results in a significantly reduced offset. Additionally, we provide a photometric characterization of our EW-type eclipsing binaries that can be applied to further analyses.
△ Less
Submitted 3 February, 2021;
originally announced February 2021.
-
AIM 2020 Challenge on Video Extreme Super-Resolution: Methods and Results
Authors:
Dario Fuoli,
Zhiwu Huang,
Shuhang Gu,
Radu Timofte,
Arnau Raventos,
Aryan Esfandiari,
Salah Karout,
Xuan Xu,
Xin Li,
Xin Xiong,
Jinge Wang,
Pablo Navarrete Michelini,
Wenhao Zhang,
Dongyang Zhang,
Hanwei Zhu,
Dan Xia,
Haoyu Chen,
Jinjin Gu,
Zhi Zhang,
Tongtong Zhao,
Shanshan Zhao,
Kazutoshi Akita,
Norimichi Ukita,
Hrishikesh P S,
Densen Puthussery
, et al. (1 additional authors not shown)
Abstract:
This paper reviews the video extreme super-resolution challenge associated with the AIM 2020 workshop at ECCV 2020. Common scaling factors for learned video super-resolution (VSR) do not go beyond factor 4. Missing information can be restored well in this region, especially in HR videos, where the high-frequency content mostly consists of texture details. The task in this challenge is to upscale v…
▽ More
This paper reviews the video extreme super-resolution challenge associated with the AIM 2020 workshop at ECCV 2020. Common scaling factors for learned video super-resolution (VSR) do not go beyond factor 4. Missing information can be restored well in this region, especially in HR videos, where the high-frequency content mostly consists of texture details. The task in this challenge is to upscale videos with an extreme factor of 16, which results in more serious degradations that also affect the structural integrity of the videos. A single pixel in the low-resolution (LR) domain corresponds to 256 pixels in the high-resolution (HR) domain. Due to this massive information loss, it is hard to accurately restore the missing information. Track 1 is set up to gauge the state-of-the-art for such a demanding task, where fidelity to the ground truth is measured by PSNR and SSIM. Perceptually higher quality can be achieved in trade-off for fidelity by generating plausible high-frequency content. Track 2 therefore aims at generating visually pleasing results, which are ranked according to human perception, evaluated by a user study. In contrast to single image super-resolution (SISR), VSR can benefit from additional information in the temporal domain. However, this also imposes an additional requirement, as the generated frames need to be consistent along time.
△ Less
Submitted 14 September, 2020;
originally announced September 2020.
-
Image Super-Resolution using Explicit Perceptual Loss
Authors:
Tomoki Yoshida,
Kazutoshi Akita,
Muhammad Haris,
Norimichi Ukita
Abstract:
This paper proposes an explicit way to optimize the super-resolution network for generating visually pleasing images. The previous approaches use several loss functions which is hard to interpret and has the implicit relationships to improve the perceptual score. We show how to exploit the machine learning based model which is directly trained to provide the perceptual score on generated images. I…
▽ More
This paper proposes an explicit way to optimize the super-resolution network for generating visually pleasing images. The previous approaches use several loss functions which is hard to interpret and has the implicit relationships to improve the perceptual score. We show how to exploit the machine learning based model which is directly trained to provide the perceptual score on generated images. It is believed that these models can be used to optimizes the super-resolution network which is easier to interpret. We further analyze the characteristic of the existing loss and our proposed explicit perceptual loss for better interpretation. The experimental results show the explicit approach has a higher perceptual score than other approaches. Finally, we demonstrate the relation of explicit perceptual loss and visually pleasing images using subjective evaluation.
△ Less
Submitted 1 September, 2020;
originally announced September 2020.
-
NTIRE 2020 Challenge on Perceptual Extreme Super-Resolution: Methods and Results
Authors:
Kai Zhang,
Shuhang Gu,
Radu Timofte,
Taizhang Shang,
Qiuju Dai,
Shengchen Zhu,
Tong Yang,
Yandong Guo,
Younghyun Jo,
Sejong Yang,
Seon Joo Kim,
Lin Zha,
Jiande Jiang,
Xinbo Gao,
Wen Lu,
Jing Liu,
Kwangjin Yoon,
Taegyun Jeon,
Kazutoshi Akita,
Takeru Ooba,
Norimichi Ukita,
Zhipeng Luo,
Yuehan Yao,
Zhenyu Xu,
Dongliang He
, et al. (38 additional authors not shown)
Abstract:
This paper reviews the NTIRE 2020 challenge on perceptual extreme super-resolution with focus on proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor 16 based on a set of prior examples of low and corresponding high resolution images. The goal is to obtain a network design capable to produce high resolution results with the best percept…
▽ More
This paper reviews the NTIRE 2020 challenge on perceptual extreme super-resolution with focus on proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor 16 based on a set of prior examples of low and corresponding high resolution images. The goal is to obtain a network design capable to produce high resolution results with the best perceptual quality and similar to the ground truth. The track had 280 registered participants, and 19 teams submitted the final results. They gauge the state-of-the-art in single image super-resolution.
△ Less
Submitted 3 May, 2020;
originally announced May 2020.
-
Space-Time-Aware Multi-Resolution Video Enhancement
Authors:
Muhammad Haris,
Greg Shakhnarovich,
Norimichi Ukita
Abstract:
We consider the problem of space-time super-resolution (ST-SR): increasing spatial resolution of video frames and simultaneously interpolating frames to increase the frame rate. Modern approaches handle these axes one at a time. In contrast, our proposed model called STARnet super-resolves jointly in space and time. This allows us to leverage mutually informative relationships between time and spa…
▽ More
We consider the problem of space-time super-resolution (ST-SR): increasing spatial resolution of video frames and simultaneously interpolating frames to increase the frame rate. Modern approaches handle these axes one at a time. In contrast, our proposed model called STARnet super-resolves jointly in space and time. This allows us to leverage mutually informative relationships between time and space: higher resolution can provide more detailed information about motion, and higher frame-rate can provide better pixel alignment. The components of our model that generate latent low- and high-resolution representations during ST-SR can be used to finetune a specialized mechanism for just spatial or just temporal super-resolution. Experimental results demonstrate that STARnet improves the performances of space-time, spatial, and temporal video super-resolution by substantial margins on publicly available datasets.
△ Less
Submitted 29 March, 2020;
originally announced March 2020.
-
$K_{l3}$ form factors in $N_f = 2+1$ QCD at physical point on large volume
Authors:
Junpei Kakazu,
Ken-ichi Ishikawa,
Naruhito Ishizuka,
Yoshinobu Kuramashi,
Yoshifumi Nakamura,
Yusuke Namekawa,
Yusuke Taniguchi,
Naoya Ukita,
Takeshi Yamazaki,
Tomoteru Yoshié
Abstract:
We present our results of the $K_{l3}$ form factors on the volume whose spatial extent is more than $L=$10 fm, with the physical pion and kaon masses using the stout-smearing clover $N_f = 2+1$ quark action and Iwasaki gauge action at $a^{-1}\approx2.3$ GeV. The $K_{l3}$ form factor at zero momentum transfer is obtained from fit based on the next-to-leading (NLO) formula in SU(3) chiral perturbati…
▽ More
We present our results of the $K_{l3}$ form factors on the volume whose spatial extent is more than $L=$10 fm, with the physical pion and kaon masses using the stout-smearing clover $N_f = 2+1$ quark action and Iwasaki gauge action at $a^{-1}\approx2.3$ GeV. The $K_{l3}$ form factor at zero momentum transfer is obtained from fit based on the next-to-leading (NLO) formula in SU(3) chiral perturbation theory. We estimate systematic errors of the form factor, mainly coming from the finite lattice spacing effect. We also determine the value of $|V_{us}|$ by combining our result with the experiment and check the consistency with the standard model prediction. The result is compared with the previous lattice calculations.
△ Less
Submitted 5 February, 2020;
originally announced February 2020.
-
Gradient flow equation in SQCD
Authors:
Daisuke Kadoh,
Naoya Ukita
Abstract:
We propose a supersymmetric gradient flow in ${\cal N}=1$ SQCD in four dimensions. The flow equation is derived in the superfield formalism and is also given for component fields of the Wess-Zumino gauge in a gauge covariant manner. We find that the flow for the component fields is supersymmetric in a sense that the flow time derivative and any supersymmetry transformation commute with each other…
▽ More
We propose a supersymmetric gradient flow in ${\cal N}=1$ SQCD in four dimensions. The flow equation is derived in the superfield formalism and is also given for component fields of the Wess-Zumino gauge in a gauge covariant manner. We find that the flow for the component fields is supersymmetric in a sense that the flow time derivative and any supersymmetry transformation commute with each other up to a gauge transformation.
△ Less
Submitted 31 December, 2019;
originally announced December 2019.
-
$K_{l3}$ form factors at the physical point on (10.9 fm)$^3$ volume
Authors:
Junpei Kakazu,
Ken-ichi Ishikawa,
Naruhito Ishizuka,
Yoshinobu Kuramashi,
Yoshifumi Nakamura,
Yusuke Namekawa,
Yusuke Taniguchi,
Naoya Ukita,
Takeshi Yamazaki,
Tomoteru Yoshié
Abstract:
We present the calculation of the $K_{l3}$ form factors with $N_f = 2 + 1$ nonperturbatively $O(a)$-improved Wilson quark action and Iwasaki gauge action at the physical point on a large volume of (10.9 fm)$^3$ at one lattice spacing of $a = 0.085$ fm. We extract the form factors from 3-point functions with three different time separations between the source and sink operators to confirm suppressi…
▽ More
We present the calculation of the $K_{l3}$ form factors with $N_f = 2 + 1$ nonperturbatively $O(a)$-improved Wilson quark action and Iwasaki gauge action at the physical point on a large volume of (10.9 fm)$^3$ at one lattice spacing of $a = 0.085$ fm. We extract the form factors from 3-point functions with three different time separations between the source and sink operators to confirm suppression of excited state contributions. The form factors are calculated in very close to the zero momentum transfer, $q^2 = 0$, thanks to the large volume, so that stable interpolations to $q^2 = 0$ are carried out. Using our form factors, we obtain the form factor at $q^2 = 0$, $f_+(0) = 0.9603(16)(^{+14}_{\ -4})(44)(19)(1)$, where the first, second, and fifth errors are statistical, systematic errors from fit functions and the isospin breaking effect, respectively. The third and fourth errors denote the finite lattice spacing effects estimated from the renormalization factor and contribution beyond the leading order SU(3) chiral perturbation theory (ChPT). The result of $f_+(0)$ yields the Cabibbo-Kobayashi-Maskawa (CKM) matrix element, $|V_{us}| = 0.2255(13)(4)$, where the first error comes from our calculation and the second from the experiment. This value is consistent with the ones determined from the unitarity of the CKM matrix and the $K_{l2}$ decay within one standard deviation, while it is slightly larger than recent lattice calculations by at most 1.5 $σ$. Furthermore, we evaluate the shape of the form factors and the phase space integral from our results. We confirm that those results are consistent with the experiment, and also $|V_{us}|$ determined with our phase space integral agrees with the one in the above.
△ Less
Submitted 13 May, 2020; v1 submitted 30 December, 2019;
originally announced December 2019.
-
Finite size effect on vector meson and baryon sectors in 2+1 flavor QCD at the physical point
Authors:
K. -I. Ishikawa,
N. Ishizuka,
Y. Kuramashi,
Y. Nakamura,
Y. Namekawa,
E. Shintani,
Y. Taniguchi,
N. Ukita,
T. Yamazaki,
T. Yoshie
Abstract:
We investigate the finite size effect on the vector meson and the baryon sectors using a subset of the "PACS10" configurations which are generated, keeping the space-time volumes over (10 fm$)^4$ in 2+1 flavor QCD at the physical point. Comparing the results on (5.5 fm$)^4$ and (10.9 fm$)^4$ lattices the ground states of octet baryons , which are stable on the lattice, show no finite size effect w…
▽ More
We investigate the finite size effect on the vector meson and the baryon sectors using a subset of the "PACS10" configurations which are generated, keeping the space-time volumes over (10 fm$)^4$ in 2+1 flavor QCD at the physical point. Comparing the results on (5.5 fm$)^4$ and (10.9 fm$)^4$ lattices the ground states of octet baryons , which are stable on the lattice, show no finite size effect within less than 0.5% level of statistical errors. For those of vector mesons, which are unstable on the lattice, we observe that the effective masses are well below the experimental resonance levels both on (5.5 fm$)^4$ and (10.9 fm$)^4$ lattices. For the decuplet baryon sector we have found that the time dependence of the effective mass looks quite similar to that for the vector meson sector including the $Ω$ baryon channel. We discuss its origin due to a possible mixing with the nearby multihadron states. Since the $Ξ$ baryon mass can be determined with the smallest ambiguity among the vector meson and the baryon masses, we use it together with the pion and kaon masses as the physical inputs to determine the physical point.
△ Less
Submitted 11 November, 2019; v1 submitted 25 July, 2019;
originally announced July 2019.
-
Semi- and Weakly-supervised Human Pose Estimation
Authors:
Norimichi Ukita,
Yusuke Uematsu
Abstract:
For human pose estimation in still images, this paper proposes three semi- and weakly-supervised learning schemes. While recent advances of convolutional neural networks improve human pose estimation using supervised training data, our focus is to explore the semi- and weakly-supervised schemes. Our proposed schemes initially learn conventional model(s) for pose estimation from a small amount of s…
▽ More
For human pose estimation in still images, this paper proposes three semi- and weakly-supervised learning schemes. While recent advances of convolutional neural networks improve human pose estimation using supervised training data, our focus is to explore the semi- and weakly-supervised schemes. Our proposed schemes initially learn conventional model(s) for pose estimation from a small amount of standard training images with human pose annotations. For the first semi-supervised learning scheme, this conventional pose model detects candidate poses in training images with no human annotation. From these candidate poses, only true-positives are selected by a classifier using a pose feature representing the configuration of all body parts. The accuracies of these candidate pose estimation and true-positive pose selection are improved by action labels provided to these images in our second and third learning schemes, which are semi- and weakly-supervised learning. While the first and second learning schemes select only poses that are similar to those in the supervised training data, the third scheme selects more true-positive poses that are significantly different from any supervised poses. This pose selection is achieved by pose clustering using outlier pose detection with Dirichlet process mixtures and the Bayes factor. The proposed schemes are validated with large-scale human pose datasets.
△ Less
Submitted 4 June, 2019;
originally announced June 2019.
-
Supersymmetric gradient flow in the Wess-Zumino model
Authors:
Daisuke Kadoh,
Kengo Kikuchi,
Naoya Ukita
Abstract:
We propose a supersymmetric gradient flow equation in the four-dimensional Wess-Zumino model. The flow is constructed in two ways. One is based on the off-shell component fields and the other is based on the superfield formalism, in which the same result is provided. The obtained flow is supersymmetric because the flow time derivative and the supersymmetry transformation commute with each other. S…
▽ More
We propose a supersymmetric gradient flow equation in the four-dimensional Wess-Zumino model. The flow is constructed in two ways. One is based on the off-shell component fields and the other is based on the superfield formalism, in which the same result is provided. The obtained flow is supersymmetric because the flow time derivative and the supersymmetry transformation commute with each other. Solving the equation, we find that it has a damping oscillation with the flow time for nonzero mass, which is different from the Yang-Mills flow. The on-shell flow equation is also discussed.
△ Less
Submitted 5 August, 2019; v1 submitted 13 April, 2019;
originally announced April 2019.
-
Deep Back-Projection Networks for Single Image Super-resolution
Authors:
Muhammad Haris,
Greg Shakhnarovich,
Norimichi Ukita
Abstract:
Previous feed-forward architectures of recently proposed deep super-resolution networks learn the features of low-resolution inputs and the non-linear mapping from those to a high-resolution output. However, this approach does not fully address the mutual dependencies of low- and high-resolution images. We propose Deep Back-Projection Networks (DBPN), the winner of two image super-resolution chall…
▽ More
Previous feed-forward architectures of recently proposed deep super-resolution networks learn the features of low-resolution inputs and the non-linear mapping from those to a high-resolution output. However, this approach does not fully address the mutual dependencies of low- and high-resolution images. We propose Deep Back-Projection Networks (DBPN), the winner of two image super-resolution challenges (NTIRE2018 and PIRM2018), that exploit iterative up- and down-sampling layers. These layers are formed as a unit providing an error feedback mechanism for projection errors. We construct mutually-connected up- and down-sampling units each of which represents different types of low- and high-resolution components. We also show that extending this idea to demonstrate a new insight towards more efficient network design substantially, such as parameter sharing on the projection module and transition layer on projection step. The experimental results yield superior results and in particular establishing new state-of-the-art results across multiple data sets, especially for large scaling factors such as 8x.
△ Less
Submitted 12 June, 2020; v1 submitted 4 April, 2019;
originally announced April 2019.
-
Recurrent Back-Projection Network for Video Super-Resolution
Authors:
Muhammad Haris,
Greg Shakhnarovich,
Norimichi Ukita
Abstract:
We proposed a novel architecture for the problem of video super-resolution. We integrate spatial and temporal contexts from continuous video frames using a recurrent encoder-decoder module, that fuses multi-frame information with the more traditional, single frame super-resolution path for the target frame. In contrast to most prior work where frames are pooled together by stacking or warping, our…
▽ More
We proposed a novel architecture for the problem of video super-resolution. We integrate spatial and temporal contexts from continuous video frames using a recurrent encoder-decoder module, that fuses multi-frame information with the more traditional, single frame super-resolution path for the target frame. In contrast to most prior work where frames are pooled together by stacking or warping, our model, the Recurrent Back-Projection Network (RBPN) treats each context frame as a separate source of information. These sources are combined in an iterative refinement framework inspired by the idea of back-projection in multiple-image super-resolution. This is aided by explicitly representing estimated inter-frame motion with respect to the target, rather than explicitly aligning frames. We propose a new video super-resolution benchmark, allowing evaluation at a larger scale and considering videos in different motion regimes. Experimental results demonstrate that our RBPN is superior to existing methods on several datasets.
△ Less
Submitted 25 March, 2019;
originally announced March 2019.
-
Human Pose Estimation using Motion Priors and Ensemble Models
Authors:
Norimichi Ukita
Abstract:
Human pose estimation in images and videos is one of key technologies for realizing a variety of human activity recognition tasks (e.g., human-computer interaction, gesture recognition, surveillance, and video summarization). This paper presents two types of human pose estimation methodologies; 1) 3D human pose tracking using motion priors and 2) 2D human pose estimation with ensemble modeling.
Human pose estimation in images and videos is one of key technologies for realizing a variety of human activity recognition tasks (e.g., human-computer interaction, gesture recognition, surveillance, and video summarization). This paper presents two types of human pose estimation methodologies; 1) 3D human pose tracking using motion priors and 2) 2D human pose estimation with ensemble modeling.
△ Less
Submitted 25 January, 2019;
originally announced January 2019.
-
HIDES spectroscopy of bright detached eclipsing binaries from the $Kepler$ field - III. Spectral analysis, updated parameters, and new systems
Authors:
K. G. Hełminiak,
M. Konacki,
H. Maehara,
E. Kambe,
N. Ukita,
M. Ratajczak,
A. Pigulski,
S. K. Kozłowski
Abstract:
We present the latest results of our spectroscopic observations and refined modelling of a sample of detached eclipsing binaries (DEBs), selected from the $Kepler$} Eclipsing Binary Catalog, that are also double-lined spectroscopic binaries (SB2). New high resolution spectra obtained with the HIDES spectrograph, attached to the 1.88-m telescope of the Okayama Astrophysical Observatory supplemented…
▽ More
We present the latest results of our spectroscopic observations and refined modelling of a sample of detached eclipsing binaries (DEBs), selected from the $Kepler$} Eclipsing Binary Catalog, that are also double-lined spectroscopic binaries (SB2). New high resolution spectra obtained with the HIDES spectrograph, attached to the 1.88-m telescope of the Okayama Astrophysical Observatory supplemented the previous observations, allowing to improve physical parameters of some systems, and characterize three previously omitted. All the obtained radial velocities (RVs) were combined with $Kepler$ photometry, in order to derive a full set of orbital and physical parameters.
Ten out of eleven SB2s have their component spectra tomographically disentangled, and spectral analysis was performed with iSpec, in order to derive effective temperatures of components and metallicities of the systems. By comparing our results with theoretical models, we assess the age and evolutionary status of the studied objects. We find a good match to all but one systems. We have derived distances from thus determined parameters, and compare them with those from the $Gaia$ Data Release 2. For systems previously studied by other authors, our new results lead to better consistency between observations and models.
△ Less
Submitted 2 January, 2019;
originally announced January 2019.
-
Orbital and physical parameters of eclipsing binaries from the All-Sky Automated Survey catalogue - X. Three high-contrast systems with secondaries detected with IR spectroscopy
Authors:
K. G. Hełminiak,
A. Tokovinin,
E. Niemczura,
R. Pawłaszek,
K. Yanagisawa,
R. Brahm,
N. Espinoza,
N. Ukita,
E. Kambe. M. Ratajczak,
M. Hempel,
A. Jordán,
M. Konacki,
P. Sybilski,
S. K. Kozłowski,
M. Litwickim,
M. Tamura
Abstract:
We present results of the combined photometric and spectroscopic analysis of three detached eclipsing binaries, which secondary components are not visible or very hard to identify in the optical spectra - ASAS J052743-0359.7, ASAS J065134-2211.5, and ASAS J073507-0905.7. The first one is a known visual binary ADS 4022, and we found that it is a quadruple system, composed of two spectroscopic binar…
▽ More
We present results of the combined photometric and spectroscopic analysis of three detached eclipsing binaries, which secondary components are not visible or very hard to identify in the optical spectra - ASAS J052743-0359.7, ASAS J065134-2211.5, and ASAS J073507-0905.7. The first one is a known visual binary ADS 4022, and we found that it is a quadruple system, composed of two spectroscopic binaries, one of which shows eclipses. None of the systems was previously recognized as a spectroscopic binary.
We collected a number of high-resolution optical and IR spectra to calculate the radial velocities (RVs) and later combined them with MITSuME and ASAS photometry. The IR spectra were crucial for secure identification of the cooler components' lines. RV measurements were done with the TODCOR technique, and RV curves modelled with our own procedure V2FIT. Light curve modelling was performed with JKTEBOP and PHOEBE codes. Temperatures and metallicities of two systems were estimated from spectra. For the ADS 4022 system we also used the archival WDS data and new SOAR observations in order to derive the orbit of the visual pair for the first time. Ages were estimated by comparing our results with PARSEC isochrones.
The eclipsing pair A052743 A ($P=5.27$ d) is composed of a 1.03(6) M$_\odot$, 1.03(2) R$_\odot$ primary and a 0.60(2) M$_\odot$, 0.59(2) R$_\odot$ secondary. The components of the $P=21.57$ d non-eclipsing pair B likely have masses in between the two eclipsing components, and both pairs are on a $\sim$188 yr orbit around their common centre of mass. The system A065134 ($P=8.22$ d) consists of a 0.956(12) M$_\odot$, 0.997(4) R$_\odot$ primary and a 0.674(5) M$_\odot$, 0.690(7) R$_\odot$ secondary. Finally, A073507 ($P=1.45$ d), which consists of a 1.452(34) M$_\odot$, 1.635(12) R$_\odot$ primary and a 0.808(13) M$_\odot$, 0.819(11) R$_\odot$ secondary, is likely a PMS system.
△ Less
Submitted 11 December, 2018;
originally announced December 2018.
-
Supersymmetric gradient flow in N=1 SYM
Authors:
Daisuke Kadoh,
Naoya Ukita
Abstract:
The gradient flow equation is derived in four-dimensional N=1 supersymmetric Yang-Mills theory in terms of the component field of the Wess-Zumino gauge. We show that the flow-time derivative and supersymmetry transformation that is naively extended to 4+1 dimensions by replacing the four-dimensional fields with the corresponding flowed fields commute with each other up to a gauge transformation. I…
▽ More
The gradient flow equation is derived in four-dimensional N=1 supersymmetric Yang-Mills theory in terms of the component field of the Wess-Zumino gauge. We show that the flow-time derivative and supersymmetry transformation that is naively extended to 4+1 dimensions by replacing the four-dimensional fields with the corresponding flowed fields commute with each other up to a gauge transformation. In this sense, the obtained flow is supersymmetric in the Wess-Zumino gauge. We also discuss more about the symmetry of the flow equation.
△ Less
Submitted 23 November, 2022; v1 submitted 6 December, 2018;
originally announced December 2018.
-
Finite size effect on pseudoscalar meson sector in 2+1 flavor QCD at the physical point
Authors:
K. -I. Ishikawa,
N. Ishizuka,
Y. Kuramashi,
Y. Nakamura,
Y. Namekawa,
Y. Taniguchi,
N. Ukita,
T. Yamazaki,
T. Yoshie
Abstract:
We investigate the finite size effect on pseudoscalar meson masses and decay constants using a subset of the "PACS10" configurations which are generated keeping the space-time volumes over (10 fm$)^4$ in 2+1 flavor QCD at the physical point. We have tried two kinds of analyses, fixing $κ$ values or measured axial Ward identity quark masses. Comparing the results on (5.4 fm$)^4$ and (10.8 fm$)^4$ l…
▽ More
We investigate the finite size effect on pseudoscalar meson masses and decay constants using a subset of the "PACS10" configurations which are generated keeping the space-time volumes over (10 fm$)^4$ in 2+1 flavor QCD at the physical point. We have tried two kinds of analyses, fixing $κ$ values or measured axial Ward identity quark masses. Comparing the results on (5.4 fm$)^4$ and (10.8 fm$)^4$ lattices, we have found a sizable finite size effect on the pseudoscalar meson sector in the former analysis: a 2.1(8)%, 4.8(1.6)%, and 0.36(31)% finite size effect on $m_π$, $m_{\rm ud}$, and $f_π$, respectively, on the (5.4 fm$)^4$ lattice. For the latter analysis, the finite size effect on the pseudoscalar meson decay constants is 0.66(33)% for $f_π$, 0.26(13)% for $f_K$, and 0.40(32)% for $f_K/f_π$. These values with two-sigma error bars are consistent with the predictions from the full one-loop SU(3) chiral perturbation theory, which are 0.20% for $f_π$, 0.08% for $f_K$, and 0.13% for $f_K/f_π$. The finite size effect on the pseudoscalar meson masses is hardly detected under the current statistical precision.
△ Less
Submitted 9 January, 2019; v1 submitted 17 July, 2018;
originally announced July 2018.
-
Utility of geometry in lattice QCD simulations
Authors:
Naoya Ukita,
Ken-Ichi Ishikawa,
Yoshinobu Kuramashi
Abstract:
We propose a way to improve the resolution of the spatial momentum and the time interval for hadron propagators utilizing the lattice geometry. We demonstrate the validity of the method presenting results for pseudoscalar meson energies with and without finite momenta in a large-scale quenched QCD simulation. The method should be useful especially for master-field simulations.
We propose a way to improve the resolution of the spatial momentum and the time interval for hadron propagators utilizing the lattice geometry. We demonstrate the validity of the method presenting results for pseudoscalar meson energies with and without finite momenta in a large-scale quenched QCD simulation. The method should be useful especially for master-field simulations.
△ Less
Submitted 2 August, 2018; v1 submitted 28 May, 2018;
originally announced May 2018.
-
Task-Driven Super Resolution: Object Detection in Low-resolution Images
Authors:
Muhammad Haris,
Greg Shakhnarovich,
Norimichi Ukita
Abstract:
We consider how image super resolution (SR) can contribute to an object detection task in low-resolution images. Intuitively, SR gives a positive impact on the object detection task. While several previous works demonstrated that this intuition is correct, SR and detector are optimized independently in these works. This paper proposes a novel framework to train a deep neural network where the SR s…
▽ More
We consider how image super resolution (SR) can contribute to an object detection task in low-resolution images. Intuitively, SR gives a positive impact on the object detection task. While several previous works demonstrated that this intuition is correct, SR and detector are optimized independently in these works. This paper proposes a novel framework to train a deep neural network where the SR sub-network explicitly incorporates a detection loss in its training objective, via a tradeoff with a traditional detection loss. This end-to-end training procedure allows us to train SR preprocessing for any differentiable detector. We demonstrate that our task-driven SR consistently and significantly improves accuracy of an object detector on low-resolution images for a variety of conditions and scaling factors.
△ Less
Submitted 29 March, 2018;
originally announced March 2018.
-
Deep Back-Projection Networks For Super-Resolution
Authors:
Muhammad Haris,
Greg Shakhnarovich,
Norimichi Ukita
Abstract:
The feed-forward architectures of recently proposed deep super-resolution networks learn representations of low-resolution inputs, and the non-linear mapping from those to high-resolution output. However, this approach does not fully address the mutual dependencies of low- and high-resolution images. We propose Deep Back-Projection Networks (DBPN), that exploit iterative up- and down-sampling laye…
▽ More
The feed-forward architectures of recently proposed deep super-resolution networks learn representations of low-resolution inputs, and the non-linear mapping from those to high-resolution output. However, this approach does not fully address the mutual dependencies of low- and high-resolution images. We propose Deep Back-Projection Networks (DBPN), that exploit iterative up- and down-sampling layers, providing an error feedback mechanism for projection errors at each stage. We construct mutually-connected up- and down-sampling stages each of which represents different types of image degradation and high-resolution components. We show that extending this idea to allow concatenation of features across up- and down-sampling stages (Dense DBPN) allows us to reconstruct further improve super-resolution, yielding superior results and in particular establishing new state of the art results for large scaling factors such as 8x across multiple data sets.
△ Less
Submitted 7 March, 2018;
originally announced March 2018.