Using a frame from the middle of the video is usually a good way to get a representative image.
I've yet to hear anybody here explain the benefit of masquerading clickbait videos. It will mean that you will not have any way of discerning low quality videos. It's like putting lipstick on a pig. I'd rather avoid the pig.