lemonilia/roleplaying-forums-raw
Viewer
•
Updated
•
244k
•
21
•
1
Note Raw roleplaying forum data, not directly usable for finetuning.
Note Semi-cleaned data via Python scripting, arranged in a convenient format.
Note Completely raw data, not directly usable for finetuning, scraped in early 2023. Note that this data has some issues with spaces between adjacent HTML tags.
Note Manually-curated and cleaned human RP dataset with 2k conversations in the 2k-8k tokens range and light augmentation from GPT3.5/4 data. Probably obsolete, now.