Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
courses:rg:2013:memm [2013/03/19 00:05] popel |
courses:rg:2013:memm [2014/10/12 15:04] (current) popel |
||
---|---|---|---|
Line 1: | Line 1: | ||
- | ===== Maximum Entropy Markov Models ===== | + | ===== Maximum Entropy Markov Models |
- | **1.** Explain (roughly) how the new formula for α_t+1(s) is derived (i.e. formula 1 in the paper). | + | 1. Explain (roughly) how the new formula for α_t+1(s) is derived (i.e. formula 1 in the paper). |
- | **2.** Section 2.1 states "we will split P(s|s', | + | 2. Section 2.1 states "we will split P(s|s', |
- | **3.** Let S= {V,N} (verb and non-verb) | + | 3. Let S= {V,N} (verb and non-verb) |
Training data = he/N can/V can/V a/N can/N | Training data = he/N can/V can/V a/N can/N | ||
// | // | ||
Line 13: | Line 13: | ||
When implementing MEMM you need to define s_0, i.e. the previous state before the first token. It may be a special NULL, but for simplicity let’s define it as N. | When implementing MEMM you need to define s_0, i.e. the previous state before the first token. It may be a special NULL, but for simplicity let’s define it as N. | ||
a) What are the states (s) and observations (o) for this training data? | a) What are the states (s) and observations (o) for this training data? | ||
- | b) Equation (2) defines features f_a based on // | + | |
+ | b) Equation (2) defines features f_a based on // | ||
c) Equation (3) defines constraints. How many such constraints do we have? | c) Equation (3) defines constraints. How many such constraints do we have? | ||
+ | |||
d) List all the constraints involving feature b2, i.e. substitute (whenever possible) concrete numbers into Equation (3). | d) List all the constraints involving feature b2, i.e. substitute (whenever possible) concrete numbers into Equation (3). | ||
+ | |||
e) In step 3 of the GIS algorithm you need to compute < | e) In step 3 of the GIS algorithm you need to compute < | ||