Hello,

I want to create an AI model to learn about AI/ML. so I have scraped some data from Threads and Instagram.now I am wondering how can I use this dataset to make an AI model or do something useful with it? (BTW I don’t know anything about AI/ML. I have done internship as Data Analyst so I know a little bit about Linear regression etc. but don’t know anything advance.)

I am really curious to explore this space :)

  • andrew0@lemmy.dbzer0.com
    link
    fedilink
    arrow-up
    3
    ·
    5 months ago

    Depending on how much compute you have available, you can look into finetuning models from HuggingFace (e.g. Llama 3, or a smaller Phi model). Look into LoRA, and try to learn how the model you choose calculates the loss.

    There are various ways to train, and usually involves masking the input by replacing random input tokens with the mask token. I won’t go into too much detail with this, because it’s a lot to explain, and I suggest you read an article on this (link1 or link2)