TikTok interview question

How do you do video clustering algorithm (unsupervised learning scenario)