If you are looking for an interesting paper on this topic, the seminal work is likely:
: The site frequently produces its own original titles, alongside hosting content from similar niche providers like Ullu.
However, if you are referring to the dataset specifically, here is an overview of why this is an interesting topic and what the key papers focus on: