Learning Cross-modal Embeddings for Cooking Recipes and Food Images


Staying with the food theme, researchers from MIT create a dataset of over 1M cooking recipes and 800k food images. They propose an image-recipe retrieval task and train a neural network to find a recipe given an image. An online demo is also available. Rather than just using the model to retrieve a recipe, I would have loved to see them actually generate the recipe given an image, for instance using the neural checklist model (Kiddon et al., 2016).


Want to receive more content like this in your inbox?