Sleeping veureu-svision ๐ฆ Process images and videos to generate descriptions, face embeddings, and scene cuts