VK's AI researchers have released the VK-LSVD (Large Short-Video Dataset) dataset to the public. According to the press service, engineers and scientists will be able to develop and improve recommendation algorithms with its help, in order to make services and products more personalized.
The dataset is available on Hugging Face and includes 40 billion anonymized unique interactions of 10 million users with 20 million short videos over six months (January-June 2025), including "aggregated likes, dislikes, shares, watch time, and playback context."
All data is presented in the form of numerical identifiers, which ensures confidentiality. An embedding (numerical description of the content) is provided for each video, and socio-demographic characteristics are provided for each user. VK explained:
Short videos are a unique format for recommendation algorithms. Unlike music, podcasts, or long videos, they cannot be 'consumed' in the background, and each video shown receives some reaction from the user. Even if the user does not leave a like, skipping or watching the video is already considered feedback.
Now on home
Герой России Гарнаев: никто из профессионалов о возобновлении производства на КАЗ всерьёз не говорит
Система отслеживает спутники на высотах до 50 000 км и ведёт за ними наблюдение
The armored vehicle is equipped with a KamAZ-740.35-400 diesel engine with a power of 400 hp.
Constant improvements in avionics, weapons and tactical capabilities will make the aircraft a flexible response to future challenges
The exterior of the KamAZ-54901 features fairings on the cab and chassis for fuel economy
Fighters are in demand both domestically and abroad
Tyazhpromexport and Venezuela Agree on Plant Revival
The company not only completed the state order, but also quickly mastered the production of AK-12K for special forces
Experts have developed a photogrammetric complex with a resolution of less than 1 cm