Linguistics/Syntax
From Wikibooks, the open-content textbooks collection
Contents |
[edit] Introduction
The field of syntax[1] looks at the mental 'rules' that we have for forming sentences and phrases. In English, for instance, it is grammatical to say 'I speak English', but ungrammatical to say 'English speak I' — this is because of a rule which says that subjects normally precede verbs which precede the object.
Academic syntacticians often study either the grammar of a particular language (e.g. English Grammar), or study the various theories of a Generative Grammar — a theory which claims there is a universal underlying grammar in our heads, which different languages activate in different ways. The main concern of Generative Grammar is discovering the grammatical rules which apply to all languages, and determining how the manifest differences in world languages can be accounted for.
However, there are many competing theories which do not make — or at least remain neutral on — such a strong claim regarding a universal grammar. They include construction grammar, and cognitive linguistics. One school of thought particularly hostile to the universalist pretensions of generative grammar is the théorie des opérations enonciatives.
At the level of syntax, it is important to understand the distinction between grammaticality and semantic soundness. Sentences such as Me Tarzan, you Jane clearly make sense even though they sound "wrong". Conversely, the sentence Colorless green ideas sleep furiously (famously created by Noam Chomsky) doesn't sound "wrong" in the way the last sentence did, but it is hard to imagine anything which it would describe. We can say that these two aspects of acceptability vary independently: grammaticality, or well-formedness, gives the rules for how sentences must be constructed, but does not restrict their content, while semantic acceptibility may occur even in ill-formed sentences. When we say that something may occur grammatically, we mean that well-formedness is preserved, with complete disregard for semantics.
[edit] Units
The basic units of syntax are words and clitics.
[edit] Constituent Structure
Theories of sytax generally view sentences as being made up of constituents, which are either made up of constituents themselves or are irreducible units. Constituents which are made up of multiple words are known as phrases.
To understand the concept of a constituent, consider the sentence:
(1a) I hit the big man
Reflecting upon the meaning of this sentence, we see that there are three semantic components of what is being stated:
- There is an action occurring, which is hitting.
- The speaker is the person performing the action.
- The big man is the person undergoing the action.
Thus it seems natural that "I", "hit", and "the big man" should function as units, while "I hit the" and "hit the big" should not. The meaning of these semantic relations will be explored further in the following chapter, but suffice it to say that you probably already have some innate intuition about what a constituent is in some sense.
According to the most common analysis, sentence (1a) can be seperated into constituents like this:
(1b) (I (hit (the (big man) ) ) )
Each constituent is assigned to a category, and constituents of a category generally may always appear in the same position in a sentence. For instance, in sentence
(2a) I see _____"
Constituents of the same category as "the big man" (e.g. "the doll", "a fast, red car", "this quickly-moving snowflake") may occur in the gap, whereas constituents similar to "eats pie" (e.g. "drives fast", "writes a letter") cannot.
We will soon explore how constituent structures can be discovered, but we first must make note of their internal structure. Phrases are generally binary-branching,[2] with one of the elements making up the phrase being known as the head and the other as the complement. The head is defined to be the element which gives the phrase its defining properties. Going back to our previous example, let's look at a larger set of phrases which can be inserted into (2a);
- a cat; a dog; a fast, red car; a paper; a small toy
- the big man; the doll; the house; the zebra
- this Argentinian hyena; this umpire; this quickly-moving snowflake
- that big-scale event; that wolf
- every impressively large elephant; every person
What do these phrases all have in common? We might be inclined to say that the defining property is the presence of a noun, but this fails for our purpose. Note that the following sentence is ungrammatical:
(2b) *I see cat
The only other common element is what is known as a determiner (a, the, this, that, every, etc.). According to this analysis,[3] these constituents are headed by a determiner and have complements of the form cat; dog; fast, red car; etc. As it is conventional to name phrases after their heads, we will call these determiner phrases, or DPs for short.
[edit] Testing Constituency
There are a number of tests we can use to get an idea whether a group of words in a sentence is a constituent or not. Many of these are largely language-dependent, even though the concept of a constituent is thought to be universal.
[edit] Grammatical relations
Constituents bear grammatical relation (or grammatical function), which detemines how they behave syntactically within a proposition.
In a simplified analysis, the sentence may be viewed as generally consisting of three major constituents, which play the grammatical roles of subject, object, and verb.
[edit] Notes
- ↑ Colloquially this may be referred to as 'grammar', but this is technically incorrect, as 'grammar' also includes morphology, (as well as other aspects of linguistics in some usages).
- ↑ Whether this is always true is very controversial. Noam Chomsky's Minimalist theory states that this is always the case, but phrases such as "Andrew and Bob and Carol" create a dilemma.
- ↑ Other analyses are possible, but beyond our scope at the moment.