Spark朴素贝叶斯(Naive Bayes)分类理论和源码分析
https://nlp.stanford.edu/IR-book/html/htmledition/naive-bayes-text-classification-1.html
https://nlp.stanford.edu/IR-book/html/htmledition/properties-of-naive-bayes-1.html
https://nlp.stanford.edu/IR-book/html/htmledition/the-bernoulli-model-1.html
拉格朗日求解模型参数 https://zhuanlan.zhihu.com/p/71960086
平滑: 伯努利:分子+1,分母+2。每个词只有{0,1}两个状态,因此是2 多项式:分子+1, 分母每个词都要+1
LDA:
- https://www.cnblogs.com/breezedeus/archive/2013/01/20/2868930.html https://blog.csdn.net/u011414416/article/details/51168242
- LDA数学八卦
- spark源码 https://endymecy.gitbooks.io/spark-ml-source-analysis/content/聚类/LDA/docs/docs/docs/docs/docs/docs/On%20Smoothing%20and%20Inference%20for%20Topic%20Models.pdf
分布式增量计算均值、方差 Spark:
- https://blog.csdn.net/snaillup/article/details/75208638
- https://endymecy.gitbooks.io/spark-ml-source-analysis/content/%E5%9F%BA%E6%9C%AC%E7%BB%9F%E8%AE%A1/summary-statistics.html
谱聚类:
- https://www.cnblogs.com/pinard/p/6221564.html
条件高斯分布 + 贝叶斯定理
- 滴滴“猜您去哪”的算法实现 http://www.semocean.com/lbs%e7%8c%9c%e4%bd%a0%e5%8e%bb%e5%93%aa%e5%84%bf%e5%8a%9f%e8%83%bd%e5%ae%9e%e7%8e%b0/ http://www.sohu.com/a/158543230_355140
搜索推荐相关
- 阿里的TDM模型和JTM : https://mp.weixin.qq.com/s/QThtXNF9oIE5bXF5kaSg5w
深度学习 相关
- Attention: https://mp.weixin.qq.com/s/MzHmvbwxFCaFjmMkjfjeSg
- 矩阵微积分:https://explained.ai/matrix-calculus/index.html(或者下载pdf)
L2 Norm:L2范数归一化
gensim的w2v_model.wv.syn0norm就是L2归一化后的向量 https://www.cnblogs.com/Kalafinaian/p/11180519.html
信息量、熵、交叉熵、互信息PMI
https://blog.csdn.net/pipisorry/article/details/51695283
梯度提升树,xgboost
https://mp.weixin.qq.com/s/7n1nzGL7r789P9sv0GEkDA 并行:http://zhanpengfang.github.io/418home.html
广告归因
http://www.sohu.com/a/121493914_495674 https://zhuanlan.zhihu.com/p/37530954
评估指标
- AUC:https://zhuanlan.zhihu.com/p/52930683
python 特征处理
https://www.kesci.com/home/project/5d86eced8499bc002c108cc8
损失函数求导
- 二分类-交叉熵 https://blog.csdn.net/jasonzzj/article/details/52017438
- softmax-交叉熵 https://blog.csdn.net/qian99/article/details/78046329
百面机器学习
https://github.com/linglu/baimian https://www.jianshu.com/p/576595f9a16b pdf
CS231n
https://zhuanlan.zhihu.com/p/21930884
CS224D
https://blog.csdn.net/pirage https://blog.csdn.net/NeighborhoodGuo/category_5631741.html https://blog.csdn.net/yaoqiang2011/category_6256954.html https://blog.csdn.net/longxinchen_ml/category_6256967.html http://cs224d.stanford.edu/syllabus.html 学习/斯坦福课程