Data release for the ImageInWords (IIW) paper.
-
Updated
May 25, 2024 - JavaScript
Data release for the ImageInWords (IIW) paper.
DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception
Content-Based Image Retrieval System
This repo represents our machine learning project Image Description which is used to generate a description of an image based on activities and objects detected in the image.
In this project, we use a Deep Recurrent Architecture, which uses CNN (VGG-16 Net) pretrained on ImageNet to extract 4096-Dimensional image feature Vector and an LSTM which generates a caption from these feature vectors.
we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI
Integrate AI capabilities into a DevExpress-powered Office File API Web API application.
NL Generation from structured inputs. Focuses on generating natural language descriptions for images by exploring the relationship between textual descriptions and image attributes. Leveraging an encoder-decoder architecture with LSTM cells, the system transforms normalized vector representations of attributes into fixed-length vector.
Trabalho de Conclusão de Curso de Engenharia de Computação (UTFPR): Descritor de imagem baseado em curvas de Hilbert
Testing the Moondream tiny vision model
Key Pointers/ Exhaustive Notes for various Machine Learning Research Papers
Lucene Image Retrieval (LIRe) code to extract Open Access Series of Imaging Studies (OASIS) features.
Add a description, image, and links to the image-descriptions topic page so that developers can more easily learn about it.
To associate your repository with the image-descriptions topic, visit your repo's landing page and select "manage topics."