Return to Article Details A mobile device framework for video captioning using multimodal neural networks Download Download PDF