linguistics/syntax.md


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343

---
layout: linguistics
title: linguistics/syntax
---

# morphology and syntax

Morphology is the study of word formation. Syntax is the study of sentence formation.<br>
Specifically, both morphology and syntax focus on **structure**.

The distinction between syntax and morphology varies cross-linguistically.<br>
They can be considered to form an overarching **morphosyntactic** theory.

These notes are ordered in a way that I feel builds upon itself the best. This is not the order in which topics were covered in my syntax class, nor in my textbook. My syntax class covered Agree before Move, and my textbook deeply intertwined Merge with X'-theory and Move with Agree: and I think the both of them suffered a little bit pedagogically for that.

Certainly, all of syntax cannot be taught at once. Yet the desire to generalize and apply what one has learned to real-world examples is strong, and it is extraordinarily difficult to teach syntax in a way that builds upon itself naturally. This is my best attempt, but it will fall flat in places: when it does, I do recommend either skipping ahead or being content with temporarily (hopefully temporarily) not knowing what's going on.

<details markdown="block">
<summary>Table of Contents</summary>

- History of Syntax
  - A wrong approach: Phrase Structure Rules
- Morphology [SKS 2]
  - Syntactic Categories [SKS 2.1]
  - Compositionality [SKS 2.3]
  - Headedness [SKS 2.4]
- Merge, Part I
  - Constituency [SKS 3]
    - Constituency Tests [SKS 3]
  - Heads, Specifiers and Complements
  - Silent Heads
- Notation
  - X'-theory [SKS 6]
  - Bare Phrase Structure [n/a]
  - Lexical Entries [SKS 6.8]
- Minimalism [n/a]
- Merge, Part II
  - Projection [SKS 5]
  - Selection
    - Small Clauses
- Move [SKS 8]
  - Head Movement (affix hopping) [SKS 8.3]
  - Questions
  - vP Shells [SKS 12.4]
- Agree
  - Theta Roles (assigned by verbs)
  - Locality (c-command)
  - Binding (Principles A, B, C) [SKS 7]
    - Small Clauses [SKS 7.4]
  - Raising & Control [SKS 9]
- Advanced Syntax
- References

</details>

> Be warned! These notes are incomplete and almost certainly somewhat inaccurate. Proceed at your own risk.

## history

### a wrong approach: phrase structure rules

## morphology

### syntactic categories

### compositionality

### headedness

## Merge, Part I

We concluded the following from our excursion into morphology:
- words are composed of morphemes
- morphemes come in categories
- morphemes combine in a regular fashion
- morphemes can be silent

Surprisingly (or unsurprisingly), we shall see that these ideas generalize to sentence structure as a whole.

### constituency

### heads, specifiers, and complements

### silent heads

Why are proper names Ds? Why is it possible to say either *I moved the couches* and *I moved couches*, but only possible to say *I moved the couch* and not *I moved couch*? Why is the infinitive form of a verb identical to the present, in some cases?

These inconsistencies can be all addressed by one (controversial) concept: the idea of *silent morphemes*, invisible in writing and unpronounceable in speech. We represent such morphemes as ∅, and so may write the earlier strange sentence as *I moved ∅-couches*.

...

p-features | f-features
-----------|-----------
the | $D_{N}$
a | $D_{N (-plural)}$
∅ | $D_{N (+plural)}$

p-features | f-features | s-features
-----------|------------|-----------
will | $T_{D,V}$ | future
-ed | $T_{D,V}$ | past
∅ | $T_{D,V}$ | present
to | $T_{D,V} (-tense)$ | infinitive

These tables are using notation and language formally introduced at the end of the next section. Ignore them for now.

## notation

So far, we've been discussing syntax and giving examples using somewhat informal notation. We now formalize this notation.

### X'-theory

**X'-theory** (x-bar theory) is a notation originally put forth by Chomsky...

...

### Bare Phrase Structure

**Bare Phrase Structure** (BPS) is a more modern notation that does away with much of the notational cruft of X'-theory. Instead of bar levels and distinctions between bar levels and "phrases", we simply put the *formal features* of our lexicon in the chart itself and only indicate the *types* of phrases. Whether a phrase has yet to close yet or not (previously indicated by a 'bar) is now indicated by whether there are any unsatisfied selectional requirements on the phrase label.

**Head-Initial Phrases**

![`[X [X_Y (head)] [Y (complement)]]`](head-initial.png)
<details markdown="block">
<summary>LaTeX</summary>

```forest
\begin{forest}
[$X$
  [$X_Y$ [(head)]]
  [$Y$ [(complement)]]]
\end{forest}
```

</details>

**Head-Final Phrases**

![`[X [Y (complement)] [X_Y (head)]]`](head-final.png)
<details markdown="block">
<summary>LaTeX</summary>

```forest
\begin{forest}
[$X$
  [$Y$ [(complement)]]
  [$X_Y$ [(head)]]]
\end{forest}
```

</details>

Recall that adjuncts are able to appear on either side of their head. Also recall that adjuncts *select* for their head. We indicate this in our labeling: adjuncts, like heads, have their selectional requirements marked, but do not propagate their type. While certain constructions may lead to notational ambiguity - an adjunct and a head of the same type, specifically - this is rare enough (only really occurring with adverbs) that we take the convenience of BPS notation regardless.

**Left Adjuncts**

![`[X [Y_X (left adjunct)] [X (head)]]`](left-adjunct.png)
<details markdown="block">
<summary>LaTeX</summary>

```forest
\begin{forest}
[$X$
  [$Y_X$ [(left adjunct)]]
  [$X$ [(head)]]]
\end{forest}
```

</details>

**Right Adjuncts**

![`[X [X (head)] [Y_X (right adjunct)]]`](right-adjunct.png)
<details markdown="block">
<summary>LaTeX</summary>

```forest
\begin{forest}
[$X$
  [$X$ [(head)]]
  [$Y_X$ [(right adjunct)]]]
\end{forest}
```

</details>

As a reminder, English is not consistently head-initial. Subjects ("specifiers") in English appear before the verb, breaking this convention. This common structure is represented as the following:

![`[X [Y (specifier)] [X_Y [X_{Y,Z} (head)] [Z (complement)]]]`](english-specifier.png)
<details markdown="block">
<summary>LaTeX</summary>

```forest
\begin{forest}
[$X$
  [$Y$ [(specifier)]]
  [$X_Y$
    [$X_{Y,Z}$ [(head)]]
    [$Z$ [(complement)]]]]
\end{forest}
```

</details>

The lexicon and structure are blended in bare phrase structure. This is useful, and allows us to indicate more specific selectional requirements on the tree itself.

...

It cannot be emphasized enough that notational conventions are *just that*: notational conventions. There's nothing stopping us from exclusively using X'-notation or exclusively using BPS, and syntactic concepts are *not* tied to any specific notation. I will pretty much exclusively use BPS going forth as I like it a whole lot more.

### lexical entries

We have stated that Bare Phrase Structure pulls aspects of the lexicon directly into the syntax tree. But what is a lexicon?

A **lexicon** is a language speaker's internal collection of lexical entries. But what is a lexical entry?

What exactly a lexical entry contains is up to some debate. The English language consists of (significantly) upwards of 400,000 words. How humans can hold that much information in our mind, and retrieve it so quickly? This is biologically interesting, and there are arguments for what such entries should and should not contain that come from such fields. For our purposes, we will focus entirely on syntactic analysis, and ignore biological motivations. We treat a **lexical entry** as containing the following information about an associated morpheme:
- phonetic features (**p-features**): how the word is pronounced
  - With our focus on syntax, we shall simply consider this the standard written representation of the morpheme. But it should really be written in IPA.
- formal features (**f-features**): the type of the morpheme and what types it selects, if any
  - These are often written directly on our tree in BPS. While most often they are simply the types of the arguments - heads can select for much more granular features, i.e. -tense, +animacy, etc.
- semantic features (**s-features**): the role of the entry and its arguments in the sentence
  - Not all lexical entries have s-features. For tense/aspect/etc, these are their appropriate tense/aspect/etc. For verbs, these are typically *theta roles* (which we shall address later).

## Minimalism

[Minimalism](https://en.wikipedia.org/wiki/Minimalist_program) is a *program* that aims to reduce much of the complexity surrounding syntactic analysis. While our theories may end up providing for adequate analyses of natural languages, this is not enough. Phrase structure rules, too, were *adequate*: yet we rejected them for their sheer complexity. If we can explain what we observe in a simpler framework, *we should adopt that framework*. Much of modern advancements in syntactic analysis have come out of Minimalism: bare phrase structure, in particular.

As with most Chomskyan theories: Minimalism has a *strong* focus on natural language facilities. A core thesis is that *"language is an optimal solution to legibility conditions"*. I don't find this interesting, so I won't get into it, and instead will focus on the definitions and usage of the basic operations rather than the motivation for them.

Modern Minimalism considers into three *basic operations*: <span style="font-variant: small-caps;">Merge</span>, <span style="font-variant: small-caps;">Move</span>, and <span style="font-variant: small-caps;">Agree</span>. All that we will discuss can fall into one of these basic camps.

## Merge, Part II

<span style="font-variant: small-caps;">Merge</span>(α, β) is a function that takes in two arguments of type α and β and outputs a single node of either type α or β.

Merge is *the* fundamental underlying aspect of syntax and arguably language as a whole. Compositionality, headedness, movement (in some camps), and a whole lot more can be considered to be rolled into it.

### projection

### selection

## Move

### head movement

### wh-movement

### vP shells

Consider the following sentence: *Alice will speak to the assembly*. With our current knowledge of syntax, we would diagram it as so:

![`[T [D Alice] [T_D [T_{D,V} will] [V [V_P speak] [P [P_D to] [D [D_N the] [N assembly]]]]]]`](no-subject-movement.png)
<details markdown="block">
<summary>LaTeX</summary>

```forest
\begin{forest}
  [$T$
    [$D$ [Alice, roof]]
    [$T_D$
      [$T_{D,V}$ [will, roof]]
      [$V$
        [$V_P$ [speak]]
        [$P$
          [$P_D$ [to]]
          [$D$
            [$D_N$ [the]]
            [$N$ [assembly]]]]]]]
\end{forest}
```

</details>

The $D$ *Alice* here is the subject. While replacing it with some $D$s produces grammatical sentences ex. *The prime minister will speak to the assembly*: this is not true of all $D$s. Slotting in inanimate $D$s like *Time will speak to the assembly* and *Knowledge will speak to the assembly* produces grammatically unacceptable sentences. So there is some *selection* occurring somewhere in the sentence that wants a particular *feature set* (f-features) from the subject $D$.

Observe, however, that our tree structure suggests that $T$ - and only $T$ - is involved in the selection of $Alice$ as the subject, given locality of selection. But this can't be quite right. Plenty of other sentences involving the $T$ *will* are just fine with inanimate subjects: *Time will pass*, *Knowledge will be passed on*, etc. (Notice that *Alice will pass* and *Alice will be passed on* are similarly ungrammatical). How do we reconcile this?

We now introduce the idea of $vP$ shells and V-to-T movement. Our observations above point towards the $V$ of the sentence rather than the $T$ selecting for the subject $D$ - somehow. This selection would break our guiding principle of locality of selection. But this behavior *does* occur. Can we extend our model to explain this, *without* modifying the locality of selection that has been so useful thus far? We can, indeed, and illustrate so in the following tree.

![`[T [D Alice] [T_D [T_{D,V} will] [V [D (subj)] [V_D [V_{D,P} speak] [P [P_D to] [D [D_N the] [N assembly]]]]]]]`](subject-movement.png)
<details markdown="block">
<summary>LaTeX</summary>

```forest
\begin{forest}
  [$T$
    [$D$ [Alice, roof, name=Alice]]
    [$T_D$
      [$T_{D,V}$ [will, roof]]
      [$V$
        [$D$ [(subj), roof, name=subj]]
        [$V_D$
          [$V_{D,P}$ [speak]]
          [$P$
            [$P_D$ [to]]
            [$D$
              [$D_N$ [the]]
              [$N$ [assembly]]]]]]]]
    \draw[->,draw opacity=0.5] (subj) to[out=west,in=south] (Alice);
\end{forest}
```

</details>

So we say that *Alice* is originally selected by the $V$ and *moves* to its surface position in the $T$. Our head movement principles allow for this. This does mean that every tree diagram we have drawn up until now is inaccurate, and that almost every tree we draw going forward will have to have this V-to-T movement. This is a fine tradeoff to make in exchange for accurately describing previously-unclear syntactic behavior.

> Note: this is not called V-to-T movement. What *is* it called?

### small clauses

## Agree

### theta roles

### locality

### binding

How do pronouns work?

...

The theory of binding operates under three fundamental principles.
- **Principle A**: an anaphor must be bound in its domain.
- **Principle B**: a pronoun must be free in its domain.
- **Principle C**: an r-expression may never be bound.

Our principles imply various things. Principle A implies that:
- a reflexive must be coreferential with its antecedent
- the antecedent of a reflexive must c-command the reflexive
- the reflexive and its antecedent must be in all the same nodes that have a subject


### raising and control

## Advanced Syntax

## References

- ✨ [An Introduction to Syntactic Analysis and Theory](https://annas-archive.org/md5/11bbf70ff9259025bc6985ba3fa4083b)
- MIT 24.902: [2017](https://web.mit.edu/norvin/www/24.902/24902.html), [2015](https://ocw.mit.edu/courses/24-902-language-and-its-structure-ii-syntax-fall-2015/), [2003](https://ocw.mit.edu/courses/24-902-language-and-its-structure-ii-syntax-fall-2003/)