Data Exploration
Pandas
Shape
Describe
Value_Counts
Pre Processing (Normalization)
NLTK
Regular Expression Replacement
Removing Punctuation
Removing Stop Words
Tokenization
Stemming
Lemmatization
Vectorization
N GRAMS
One Hot
Count Vectorization
Inverse Frequency
TF-IDF
WORD2VEC
Algorithms/Libraries
Machine Learning: Scikit-Learn
Feature Engineering
- Categorical Encoding
- Feature Evaluation
- Feature Transformation
- Feature Generation
- Feature Selection
Model Building, Training and Prediction
- Random Forest
- Cross Validation
- Gradient Boosting Grid Search
DeepLearning: Keras, Tensorflow, Pytorch, fast.ai
- Dense Ckassifier
- Convolutional Classifier
- Recurrent Classifier
- LSTM Classifier
- Bi-Directional LSTM Classifier
- Stacked Recurrent Classifier
- Seq2seq
- Attention
- Transfer Learning
- GRU
- Functional API – Multi-ConvNet
AutoML: GCP, H2O.ai
Model Evaluation
ROC Curve
Confusion Matrix
Accuracy
Precision
Recall
References:
https://learning.oreilly.com/library/view/deep-learning-illustrated/9780135116821/
https://learning.oreilly.com/library/view/hands-on-machine-learning/9781492032632/
https://www.kaggle.com/learn/natural-language-processing
https://learning.oreilly.com/videos/deep-learning-for/9780134851921