Skip to content

DataForScience/LLM4DS

Repository files navigation

GitHub Twitter @data4sci GitHub top language GitHub repo size GitHub last commit

Graphs For Science Data Science Briefing

Large Language Models for Data Science

Code and slides to accompany the online series of webinars: https://data4sci.com/llm4ds by Data For Science.

Large Language Models (LLMs) are powerful tools that put state-of-the-art AI capabilities at the tip of our fingers. They can process large amounts of data, understand nuance and context, and perform complex tasks at our request. Over the past few years, LLMs have multiplied as have the tools specially built to leverage their capabilities.

In this course, you will learn how to use large language models to perform data science tasks such as summarization, translation, named entity recognition, audio generation, and data processing. We’ll explore the possibilities afforded by the tools and APIs developed by OpenAI, Hugging Face, LangChain, and Pandas AI and how best to apply them to our data science work.

Schedule

1.Generative AI for Data Science

  • Generative AI
  • Large Language Models
  • OpenAI
  • HuggingFace
  • LangChain

2. Prompt Engineering for Data Science

  • Output formatting
  • Prompt Techniques
  • Zero-Shot and Few-Shot Prompting
  • Chain of Thought

3. Natural Language Processing with HuggingFace

  • Named-Entity Recognition
  • Part-of-Speech Tagging
  • Summarization
  • Question Answering

4.Text to Speech with Open AI

  • The Whisper model
  • Generatinag audio from text
  • Audio transcription
  • Automatic Translation

5.Pandas AI

  • Pandas AI library structure
  • Natural language querying
  • Data cleaning
  • Data visualization

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published