Works
Posts
7
Course
Talks
Goals
Exploration
Freelance
CV
About
Archive
Misc
📝 Posts
👤 About
🏠 Home
📕 Travel handbook
📄 Publications
⚙️ Software
🎵 Music
📝 Posts
🎓 Course
💬 Talks
🏆 Goals
🌏 Exploration
🤝 Freelance
💼 CV
🖼️ Gallery
📜 Stories
🧠 Zettelkasten
🔗 Saved articles
🔗 Saved sites
🔗 Saved videos
📲 Feed
📁 Playlists
💽 Top albums
🎞️ Top films
📚 Bookshelf
⭐ Hall of fame
🛠️ Gear & tools
💪 Calisthenics
👥 Friend blogs
🌱 Habits
❓ FAQ
👤 About
🎁 Donate
👋 Contact
Intro to NLP
Everything around is woven from words
#️⃣
⌛ ~1 h
🗿 Beginner
11.08.2023
upd:
#66
Intro to NLP
Everything around is woven from words
⌛ ~1 h
#66
Definition of NLP
Importance in data science
Historical milestones
Main NLP tasks
Core definitions (e.g., corpora)
NLP fundamentals: text preprocessing and morphological analysis
Text preprocessing steps
Tokenization
Stop word removal
Stemming and lemmatization
Morphological analysis
POS tagging
Deduplication
Word embeddings & factor analysis
Feature extraction techniques
Factor analysis for text data
Word embeddings
Advanced transformations: LSA, pLSA, and GLSA
Latent semantic analysis (LSA)
Probabilistic LSA (pLSA)
GLSA techniques
Relationship to factor analysis
Model architectures: Seq2Seq, attention, and positional encoding
Exploring seq2seq architecture
The role of attention in NLP
Positional encoding for sequence models
Transformers
Core NLP tasks
Text classification & sentiment analysis
Named entity recognition (NER)
Machine translation
Question answering (QA)
Text mining approaches
Emotion analysis
Evaluating NLP systems
Common NLP metrics
Best practices for model evaluation
Multi-class and multi-label tasks
Advanced breakthroughs & conclusion
The CLIP model
Future directions in NLP
Libraries & tools
Summary of key points
Switch to wide layout