Real World ML Pipeline:
- Understanding of business problem
- Problem formalization
- Data collecting
- Data preprocessing
- Modelling
- Way to evaluate model in real life
- Way to deploy model
Real World Aspect:
- Competition Problem formalization
- Choice of target metric
- Deployment issues
- Inference speed
- Data collecting
- Model complexity
- Target metric value
Competition Aspect:
Competition Problem formalization (N)Choice of target metric (N)Deployment issues (N)Inference speed (N)- Data collecting (Y/N)
- Model complexity (Y/N)
- Target metric value (Y)
Recap of main ML algorithms:
Overview of ML methods:
- Scikit-Learn (or sklearn) library
- Overview of k-NN (sklearn's documentation)
- Overview of Linear Models (sklearn's documentation)
- Overview of Decision Trees (sklearn's documentation)
- Overview of algorithms and parameters in H2O documentation
Additional Tools:
- Vowpal Wabbit repository
- XGBoost repository
- LightGBM repository
- Interactive demo of simple feed-forward Neural Net
- Frameworks for Neural Nets: Keras, PyTorch, TensorFlow, MXNet, Lasagne
- Example from sklearn with different decision surfaces
- Arbitrary order factorization machines
- Basic SciPy stack (ipython, numpy, pandas, matplotlib)
- Jupyter Notebook
- Stand-alone python tSNE package
- Libraries to work with sparse CTR-like data: LibFM, LibFFM
- Another tree-based method: RGF (implemetation, paper)
- Python distribution with all-included packages: Anaconda
- Blog "datas-frame" (contains posts about effective Pandas usage)
Feature preprocessing:
- Preprocessing in Sklearn
- Andrew NG about gradient descent and feature scaling
- Feature Scaling and the effect of standardization for machine learning algorithms
Feature generation:
- Discover Feature Engineering, How to Engineer Features and How to Get Good at It
- Discussion of feature engineering on Quora
- Bag of words
- Word2vec
NLP Libraries:
Feature extraction from images:
Truly mind blowing blog went amazed with the subject they have developed the content. These kind of posts really helpful to gain the knowledge of unknown things which surely triggers to motivate and learn the new innovative contents. Hope you deliver the similar successive contents forthcoming as well.
回覆刪除Stupendous blog huge applause to the blogger and hoping you to come up with such an extraordinary content in future. Surely, this post will inspire many aspirants who are very keen in gaining the knowledge. Expecting many more contents with lot more curiosity further.
data science certification in bhilai