* They are able to "explain away" missing inputs.

* Examples: Naive Bayes, Mixtures of Gaussians, HMM, Bayesian Networks, Markov Random Fields

* **Discriminative models** do everything in one-step -- they learn the posterior

* They are simpler and can use many more features, but are prone to missing inputs.

* Examples: SVM, Logistic Regression, Neural network, k-NN, Conditional Random Fields

- Each CFG rule generates just one level of the derivation tree. Therefore, using

* '' | * '' |