Case Study

project description

  • In this project, we created an AI model which can generate text or caption the given image.

  • For this project, we have used an encoder-decoder transformer, so the encoder is a Convolution Neural Network that will process the input image and create a middle output which will be passed to the decoder with a Recurrent Neural Network for generating the text from the middle output of the encoder.

Technologies used

  • We have used Python, PyTorch, and OpenCV to create this project.

  • We have built the encoder-decoder transformer model.

Difficulties we faced

  • No major difficulties were faced during this project.

Solutions

  • No major difficulties were faced during this project.

Computer Vision + NLP

6 months

Share now

Share on facebook
Facebook
Share on twitter
Twitter
Share on linkedin
LinkedIn
Share on pinterest
Pinterest
Share on whatsapp
WhatsApp
Share on email
Email

gallery

project images

ready to get started?

Receive news, announcement and reports