数据挖掘：朴素贝叶斯Naive Bayes Classifier

作者: Cache_wood | 来源:发表于2022-04-10 16:28 被阅读0次

machine learning学习笔记一
数据挖掘：朴素贝叶斯Naive Bayes Classifier
分类算法-朴素贝叶斯分类器
机器学习——朴素贝叶斯
朴素贝叶斯算法介绍及优化
朴素贝叶斯法(NaiveBayes)
第五周 - 20180507
Python 数据科学手册 5.5 朴素贝叶斯分类
机器学习数学原理（4）——朴素贝叶斯模型
朴素贝叶斯分类(Naive Bayes classifier)

@[toc]

A probabilistic framework for solving classification problems

Conditional Probability:
$P(C|A) = \frac{P(A,C)}{P(A)}\\ P(A|C) = \frac{P(A,C)}{P(C)}$
Bayes theorem:
$P(C|A) = \frac{P(A|C)P(C)}{P(A)}$
Consider each attribute and class label as random variables

Given a record with attributes $(A_1,A_2,…,A_n)$

Goal is to predict class C
Specifically, we want to find the value of C that maximizes $P(C|A_1,A_2,…,A_n)$

Approach:

compute the posterior probability $P(C|A_1,A_2,…,A_n)$ for all values of C using the Bayes theorem
$P(C|A_1A_2…A_n) = \frac{P(A_1A_2…A_n)P(C)}{P(A_1A_2…A_n)}$
Choose value of C that maximizes
$P(C|A_1,A_2,…,A_n)$
Equivalent to choosing value of C that maximizes
$P(A_1,A_2,…,A_n|C)P(C)$

Naive Bayes Classifier

Assume independence among attributes $A_i$ when class is given:

$P(A_1,A_2,…,A_n|C_j) = P(A_1|C_j)P(A_2|C_j)…P(A_n|C_j)$
Can estimate $P(A_i|C_j)$ for all $A_i$ and $C_j$ .
New point is classified to $C_j$ if $P(C_j)\Pi P(A_i|C_j)$ is maximal.

How to Estimate Probabilities from Data

For continuous attributes:

Discretize the range into bins
- one ordinal attribute per bin
- violates independence assumption
Two-way split: (A<v) or (A>v)
- cjoose only one of the two splits as new attribute
Probability density estimation
- Assume attribute follows a normal distribution
- Use data to estimate parameters of distribution(e.g., mean and standard deviation) b
- Once probability distribution is known, can use it to estimate the conditional probability $P(A_i|c)$

Normal distribution : $P(A_i|c_j) = \frac{1}{\sqrt{2\pi\sigma_{ij}^2}}e^{-\frac{(A_i-\mu_{ij})^2}{2\sigma_{ij}^2}}$

One for each $(A_i,c_i)$ pair

If one of the conditional probability is zero, then the entire expression becomes zero

Probability estimation:

c :number of classes, p :prior probability, m :parameter
$Original: P(A_i|C) = \frac{N_{ic}}{N_c}\\ Laplace:P(A_i|C) = \frac{N_{ic}+1}{N_c+c}\\ m-estimate:P(A_i|C)= \frac{N_{ic}+mp}{N_c+m}\\$

Naive Bayes(Summary)

Robust to isolated noise points.

Handle missing values by ignoring the instance during probability estimate calculations

Robust to irrelevant attributes

Independence assumption may not hold for some attributes

Use other techniques such as Bayesian Belief Networks (BBN)

machine learning学习笔记一
朴素贝叶斯分类器 classifier supervised naive bayes 1.为什么叫朴素贝叶斯，因为...
数据挖掘：朴素贝叶斯Naive Bayes Classifier
@[toc] A probabilistic framework for solving classificati...
分类算法-朴素贝叶斯分类器
前言此程序基于新闻文本分类实验使用朴素贝叶斯（Naive Bayes Classifier)模型实现分类任务。...
机器学习——朴素贝叶斯
朴素贝叶斯分类器（Naive Bayesian Classifier）概述朴素贝叶斯是基于贝叶斯，定理与特征条...
朴素贝叶斯算法介绍及优化
朴素贝叶斯（Naive Bayes）贝叶斯公式朴素贝叶斯算法其实原理很简单，要理解朴素贝叶斯算法我们首先得知道...
朴素贝叶斯法(NaiveBayes)
朴素贝叶斯法(Naive Bayes) 朴素贝叶斯法是基于贝叶斯定力和特征条件独立假设的分类方法。朴素贝叶斯法实...
第五周 - 20180507
朴素贝叶斯的思路及实现一、朴素贝叶斯简介朴素贝叶斯法（Naive Bayes）是基于贝叶斯定理与特征条件独立假...
Python 数据科学手册 5.5 朴素贝叶斯分类
5.5 朴素贝叶斯分类原文：In Depth: Naive Bayes Classification 译者：飞龙...
机器学习数学原理（4）——朴素贝叶斯模型
机器学习数学原理（4）——朴素贝叶斯模型朴素贝叶斯模型（Naive Bayes Model），是一种基于贝叶斯定...
朴素贝叶斯分类(Naive Bayes classifier)
基本概念先验概率（prior probability）是指根据以往经验和分析得到的概率，如全概率公式，它往往作为...

数据挖掘：朴素贝叶斯Naive Bayes Classifier

Naive Bayes Classifier

How to Estimate Probabilities from Data

Naive Bayes(Summary)

相关文章

machine learning学习笔记一

数据挖掘：朴素贝叶斯Naive Bayes Classifier

分类算法-朴素贝叶斯分类器

机器学习——朴素贝叶斯

朴素贝叶斯算法介绍及优化

朴素贝叶斯法(NaiveBayes)

第五周 - 20180507

Python 数据科学手册 5.5 朴素贝叶斯分类

机器学习数学原理（4）——朴素贝叶斯模型

朴素贝叶斯分类(Naive Bayes classifier)

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

想法

简友广场