Creating dataset for tamil language

Hi Guys

Posting nearly after a year!

I have developed a project using python, which extracts data from theekkathir.in newspaper to hugging Face datasets suitable for training or fine-tuning LLMs in Tamil.

Link to GitHub:-

Link to Hugging Face :hugs::-