GitHunt
KA

kacky24/stylenet

A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"

StyleNet: Generating Attractive Visual Captions with Styles

* under development

StyleNet is a novel framework to address the task of generating attractive captions for images and videos with different styles. A novel model component, named factored LSTM is used in StyleNet, which automatically distills the style factors in the monolingual text corpus.

framework
Imgur

examples of generated captions
Imgur

Description

Requirement

  • python 3.5.3
  • pytorch 0.2.0
  • torchvision 0.1.9
  • numpy 1.13.3
  • scikit-image 0.13.1
  • nltk 3.2.5

Languages

Python100.0%

Contributors

Created November 8, 2017
Updated August 7, 2025
kacky24/stylenet | GitHunt