Bootstrapping is a term used in language acquisition in the field of linguistics. It refers to the idea that human beings are born innately equipped with a mental faculty that forms the basis of language, and that allows children to effortlessly acquire language . As a process, bootstrapping can be divided into different domains, according to whether it involves semantic bootstrapping, syntactic bootstrapping, prosodic bootstrapping, or pragmatic bootstrapping.
- 1 Background
- 2 Semantic bootstrapping
- 3 Syntactic bootstrapping
- 4 Prosodic bootstrapping
- 5 Pragmatic bootstrapping
- 6 See also
- 7 References
Origin of the term "bootstrapping"
In literal terms, a bootstrap is the small strap on a boot that is used to help pull on the entire boot. Similarly in computer science, booting refers to the startup of an operation system by means of first initiating a smaller program. Therefore, bootstrapping is a general term used to refer to the leveraging of a small action into a more powerful and significant operation.
Bootstrapping in linguistics was first introduced by Steven Pinker as a metaphor for the idea that children are innately equipped with mental processes that help initiate language acquisition. Bootstrapping attempts to identify the language learning processes that enable children to learn about the structure of the target language.
Bootstrapping and connectionism
Bootstrapping has a strong link to connectionist theories which model human cognition is a system of simple, interconnected networks. In this respect, connectionist approaches view human cognition as a computational algorithm. On this view, in terms of learning, humans have statistical learning capabilities that allow them to problem solve. Proponents of statistical learning believe that it is the basis for higher level learning, and that humans use the statistical information to create a database which that allows them to learn higher-order generalizations and concepts.
For a child acquiring language, the challenge is to parse out discrete segments from a continuous speech stream. Research demonstrates that, when exposed to streams of nonsense speech, children use statistical learning to determine word boundaries. In every human language, there are certain sounds that are more likely to occur with each other: for example, in English, the sequence [st] is attested (stop), but the sequence *[gb] is not.
It appears that children can detect the statistical probability of certain sounds occurring with one another, and use this to parse out word boundaries. Utilizing these statisitical abilities, children appear to be able to form mental representations, or neural networks, of relevant pieces of information. Pieces of relevant information include word classes, which in connectionist theory, are seen as each having an internal representation and transitional links between concepts. Neighbouring words provide concepts and links for children to bootstrap new representations on the basis of their previous knowledge.
Bootstrapping and innateness
The innateness hypothesis was originally coined by Noam Chomsky as a means to explain the universality in language acquisition. All normally developing children with adequate exposure to a language will learn to speak and comprehend the language fluently. It is also proposed that despite the supposed variation in languages, they all fall into a very restricted subset of the potential grammars that could be infinitely conceived. Chomsky argued that since all grammars universally deviate very little from the same subset of general structure, and that children so seamlessly acquire language, humans must have some intrinsic language learning capability that allows us to learn language. This intrinsic capability was hypothesized to be embedded in the brain, earning the title of language acquisition device. According to this, the child is equipped with knowledge of grammatical and ungrammatical types, which he then applies to the stream of speech he is hearing in order to determine the grammar this stream is compatible with. The processes underlying this LAD relates to bootstrapping in that once a child has identified the subset of the grammar he is learning, he can then apply his knowledge of grammatical types in order to learn the language-specific aspects of the word. This relates to the Principles and Parameters theory of linguistics, in that languages universally consist of basic, unbroken principles, and vary by specific parameters.
Semantic bootstrapping investigates how children use meaning to discover the structures of the language that they are acquiring.
According to Pinker, in order for children to successfully acquire a language, the following four conditions must hold:
- Condition 1: Children learn what nouns mean in order to understand and produce well-formed utterances that contain those nouns. Accordingly, upon hearing a sentence like (1) a child must know what the nouns <boy> and <dog> mean before she or he can understand or produce such a sentence.
(1) The boy is patting the dog.
- Condition 2: Children learn to associate meaning with their linguistic input by combining real world context and the meaning of individual words in the input. Accordingly, when a child sees an action - for example, the action of the boy patting the dog - the child can use this real life context to connect <the boy> to the noun phrase and <patting the dog> to the verb phrase. As a result, the child learns that <pat> is refers to the action of moving your hand on something in a certain way.
- Condition 3: Language addressed to a child must be accompanied by nonsyntactic cues. Nonsyntactic cues can include various other aspects of language, such as prosody or visual cues. For example, differences in prosody are seen for function versus content words, allowing children to discover meanings based only on differences in pitch. We often see greater stress and contrast being placed on content words, such as nouns and verbs, with the greatest stress often being placed on the object and the subject. Children tend to learn these early prosodic cues within the first year of life, and often demonstrate competent knowledge of various prosodic aspects shortly after birth. Often, children are also spoken to with a lot of information placed on visual cues, such as pointing, or eye-gazing. This allows the child to gaze-follow and gain a visual representation of semantic information.
- Condition 4: Children must have innate knowledge of linguistic principles. For example, agents of transitive verbs are the subject of the sentence; in verb phrases, things that are affected by the action are typically objects; and nouns are the names of concrete objects, people, or places. By coming to the table knowing these differences, children are better able to segment out and learn the meaning of certain portions of speech given that they know the basic patterns innately.
Acquiring the state/event contrast
When discussing acquisition of temporal contrasts, the child must first have a concept of time outside of semantics. In other words, the child must be able to have some mental grasp on the concept of events, memory, and general progression of time before attempting to conceive of it semantically. Semantics, especially with regard to events and memory concepts, appears to be far more language-general, with meanings being more universal concepts rather than the individual segments being used to represent them. For this reason, semantics requires far more cognition than external stimuli in acquiring it, and relies much on the innate capability of the child to develop such abstraction, as the child must first have a mental representation of the concept, before attempting to link a word to that meaning. In order to actually learn time events, several processes must occur:
- The child must have a grasp on temporal concepts
- They must learn which concepts are represented in their own language
- They must learn how their experiences are representative of certain event types that are present in the language
- They must learn the different morphological and syntactic representations of these events
(Data in list cited from )
Using these basic stepping stones, the child is able to map their internal concept of the meaning of time onto explicit linguistic segments. This bootstrapping allows them to have hierarchical, segmental steps, in which they are able to build upon their previous knowledge in order to aid future learning.
Tomasello argues that in learning linguistic symbols, the child does not need to have explicit external linguistic contrasts, and rather, will learn about these concepts via social context ant their surroundings. This can be demonstrated with semantic bootstrapping, in that the child does not explicitly receive information on the semantic meaning of temporal events, but learns to apply their internal knowledge of time to the linguistic segments that they are being exposed to.
Acquiring the count/mass contrast
With regard to mapping the semantic relationships for count, it follows previous bootstrapping methods. Since the context in which children are presented with number quantities usually have visual aid to accompany them, the child has a relatively easy way to map these number concepts.
|Look at the three boys!|
In this case, granted that the child already has the mental concept for "boy" in place, she will then be able to view the multiple boys and apply her knowledge of the word and quantity of individuals to get the semantic definition.
With regard to mass, mass nouns are often thought of as being function words. They act to demonstrate the relationship between atoms of the word and substance. However, mass nouns can vary with regard to their sharpness, or the narrowness that they refer to an entity. For example, a grain of rice has a much narrower quantity definition than a bag of rice.
"Of" is a word that children are thought to learn the definition of as being something that transforms a substance into a set of atoms. For example, when you say:
"I have three gallons for sale"
"I have three gallons of water"
The word "of" is being used in the second phrase to denote the mass relationship between water and gallons. The initial substance now denotes a set. The child again is able to use visual cues in order to maintain a grasp on what this relationship is.
Syntactic boostrapping occurs when to children are able to deduce word meaning on the basis of the grammatical structure of a sentence. In this way, knowledge of grammatical structure — which includes knowledge of how phrases are organized into constituents and how these constituents combine to form sentences — "bootstraps" the acquisition of word meaning.
One or the earliest demonstrations of the existence of syntactic bootstrapping is an experiment done by Roger Brown at Harvard University in 1957. Brown showed children between the ages of three and five various pictures accompanied by nonsense words. The nonsense words included singular nouns, mass nouns, and verbs. Brown showed these pictures to a child and asked them to tell him where the nonsense word was. By placing the word in different places in the sentence, different aspects of the picture were identified by the children. For example, when Brown wanted the child to identify a mass noun, he would ask the question in (3) , and the child would point at the red confetti-like mass in the picture. To identify a verb, he would ask the question in (4). And to identify a singular noun, he would ask the question in (5).
(3) Do you see any sib? (4) What is sibbing? (5) Do you see a sib?
The children were for the most part able to identify the objects in the picture depending on where the word was introduced in the question that Brown provided. This shows that the structure of a sentence provides children with valuable clues as to the meaning of words.
Acquiring lexical categories
An early demonstration by Naigles (1990) of syntactic bootstrapping involved showing 2-year olds a video of a duck using its left hand to push a rabbit down into a squatting position while both the animals wave their right arms in circles.
Initial video: Duck uses left hand to push rabbit into squatting position while both animals wave their right arms in circles
During the video, children are presented with one of the following two descriptions:
(6) Utterance A: The duck is kradding the rabbit. (describes a situation where the duck does something to the rabbit) (7) Utterance B: The rabbit and duck are kradding. (describes a situation where the duck and the rabbit perform the same action)
Children were then presented two distinct follow-up videos.
Follow-up video 1: the duck pushing the rabbit Follow-up video 2: the duck and the rabbit are both waving their arms in the air.
When instructed to "find kradding", children looked to the video that illustrated the utterance they heard during the initial video. Children who heard utterance A interpreted kradding to mean the act of the duck pushing on the rabbit, while children who heard utterance B assumed kradding was the action of arm waving. This indicates that children arrive at interpretations of an novel verb based on the utterance context and and the syntactic structure in which it was embedded.
In 1990, Lila Gleitman took this idea further by examining the acquisition of verbs in more detail. In her study, she found that children could differentiate between verbs that take one or more arguments and that this knowledge was used to help them narrow down the potential meanings for the verb in question. This discovery explains how children can learn the meaning of verbs that cannot be observed, like ‘think’.
The acquisition of nouns is related to the acquisition of the mass/count contrast. In 1969, Willard Van Orman Quine  claimed that children cannot learn new nouns unless they have already acquired this semantic distinction. Otherwise, the word “apples” might refer to the individual objects in a pile or the pile itself, and the child would have no way to know without already understanding the difference between a mass and a count noun. Nancy N. Soja  argues that Quine is mistaken, and that children can learn new nouns without fully understanding the mass/count distinction. She found in her study that 2-year old children were able to learn new nouns (some mass, some count nouns) from inferring meaning from the syntactic structure of the sentence the words were introduced in.
In a 2010 study, Syrett and Lidz  show that children learn the meaning of novel gradable adjectives on the basis of the adverbs that modify them. Gradable adjectives have a scale associated with them: for example, the adjective “large” places the noun that it modifies on a size scale, while the adjective “expensive” places the noun that is modifiers on a price scale. In addition, gradable adjectives (GA's) subdivide into two classes: relative and maximal GA’s.
Relative GA’s are words like “big” in (5), and require a reference point: a big mouse is not the same size as a big elephant. As shown in (6) and (7), while relative GAs can be modified by the adverb very they cannot be modified by the adverb completely.
relative gradable adjectives (5) a. a big mouse b. a big elephant (6) a. a very big mouse b. a very big elephant (7) a. *a completely big mouse b. *a completely big elephant
Maximal GA’s are wolds like, “full” in (i); they operate on a close-ended scale. As shown in (9) and (10), while relative GAs cannot be modified by the adverb very they can be modified by the adverb completely.
maximal gradable adjectives (8) a. a full pool b. a full tank (9) a. ?? a very full pool b. ?? a very full tank (10) a. a completely full pool b. a completely full tank
In the 2010 study, Syrett and Lidz showed children pictures of objects that could be described in terms of both relative and maximal GA’s. For example, a picture of a container that could be described as both tall (a relative GA) and clear (a maximal GA).
When showing these objects to the children, the novel adjective used to describe them was prefaced with either adverb very (which usually modifies relative GA’s) or the adverb completely (which modifies maximal GA’s). As a control, in some contexts, no adverb was present. When the novel adjective was presented with the adverb very, the children assigned a relative GA meaning to it, and when it was presented with adverb completely, a maximal GA. When no adverb was present, the children were unable to assign a meaning to the adjective. This shows that, in order for children to learn the meaning of a new adjective, they depend on grammatical information provide by adverbs about the semantic class of the novel adjective.
Acquiring functional categories
There is a basic contrast between lexical categories (which include open-class items such as verbs, nouns, and adjectives), and functional categories (which include closed-class items such auxiliary verbs, case markers, complementizers, conjunctions and determiners. The acquisition functional categories has been studied significantly less than the lexical class, so much remains unknown. A 1998 study lead by Rushen Shi  shows that that, at a very young age, Mandarin and Turkish learners use phonological, acoustic and distributional cues to distinguish between words that are lexical categories from words that are functional categories. 11 to 20-month old children were observed speaking with their mothers to evaluate whether speech directed at the children contained clues that they could then use to categorize words as "lexical" or "function". Compared to as lexical category words, functional category words were found to have the following properties:
- simpler syllable structures
- simpler vowels (monopthongs as opposed to diphthongs)
- shorter duration
- lower amplitude
- occur much more frequently in speech
Even before infants can comprehend word meaning, prosodic details assist them in discovering syntactic boundaries. Prosodic bootstrapping or phonological bootstrapping investigates how prosodic information — which includes stress, rhythm, intonation, pitch, pausing, as well as dialectal features — can assist a child in discovering the grammatical structure of the language that she or he is acquiring.
In general, prosody introduces features the reflect either attributes of the speaker or of the utterance type. Speaker attributes include emotional state, as well as the presence of irony or sarcasm. Utterance-level attributes are used to mark questions, statements, and commands, and they can also be used to mark contrast.
- Prosodic features associated with the speaker: emotional state, irony, sarcasm
- Prosodic features associate with utterance type: question, statement, command, contrast
Similarly, in sign language, prosody includes facial expression, mouthing, and the rhythm, length, and tension of gestures and signs.
In language, words are not only categorized into phrases, clauses, and sentences. Words are also organized into prosodic envelopes. The idea of a prosodic envelope states that words that go together syntactically also form a similar intonation pattern. This explains how children discover syllable and word boundaries through prosodic cues. Overall, prosodic bootstrapping explores determining grammatical groupings in a speech stream rather than learning word meaning.
There is evidence that the acquisition of language-specific prosodic qualities start even before an infant is born. This is seen in neonate crying patterns, which have qualities that are similar to the prosody of the language that they are acquiring. The only way that an infant could be born with this ability is if the prosodic patterns of the target language are learned in utero. Further evidence of young infants using prosodic cues is their ability to discriminate the acoustic property of pitch change by 1–2 months old.
Prosodic cues for syntactic structure
Infants and young children receive much of their language input in the form of infant-directed speech (IDS) and child-directed speech (CDS), which are characterized as having exaggerated prosody and simplification of words and grammar structure. When interacting with infants and children, adults often raise and widen their pitch, and reduce their speech rate. However, these cues vary across cultures and across languages.
There are several ways in which infant and child directed speech can facilitate language acquisition. In recent studies, it is shown that IDS and CDS contain prosodic information that may help infants and children distinguish between paralinguistic expressions (e.g. gasps, laughs, expressions) and informative speech. In Western cultures, mothers speak to their children using exaggerated intonation and pauses, which offer insight about syntactic groupings such as noun phrases, verb phrases, and prepositional phrases. This means that the linguistic input infants and children receive include some prosodic bracketing around syntactically relevant chunks.
(1) Look the boy is patting the dog with his hand. (2) *Look the boy ... is ... patting the ... dog with his ... hand. (3) Look … [DP The boy] ... [VP is patting the dog] ... [PP with his hand].
A sentence like (1) will not typically be produced with the pauses indicated in (2), where the pauses "interrupt" syntactic constituents. For example, pausing between the and dog would interrupt the noun phrase (DP) constituent, as would pausing between his and hand. Most often, pauses are placed so as to group the utterance into chunks that correspond to the beginnings and ends of constituents such as noun phrases (DPs), verb phrases (VPs), and prepositional phrases (PPs). As a result, sentences like (3), where the pauses correspond to syntactic constituents, are much more natural.
Moreover, within these phrases are distinct patterns of stress, which helps to differentiate individual elements within the phrase, such as a noun from an article. Typically, articles and other unbound morphemes are unstressed and are relatively short in duration in contrast to the pronunciation of nouns. Furthermore, in verb phrases, auxiliary verbs are less stressed than main verbs. This can be seen in (4).
4. They are RUNning.
Prosodic bootstrapping states that these naturally occurring intonation packages help infants and children to bracket linguistic input into syntactic groupings. Currently, there is not enough evidence to suggest that prosodic cues in IDS and CDS facilitate in the acquisition of more complex syntax, however IDS and CDS are richer linguistic inputs for infants and children.
Prosodic cues for clauses and phrases
There is continued research into whether infants use prosodic cues – in particular, pauses – when processing clauses and phrases. Clauses are the largest constituent structure in a phrase and are often produced in isolation in conversation; for example, <Did you walk the dog?>. Consequently, phrases are smaller components of clauses. For example, <the tall man> or <walks his dog>. Peter W. Jusczyk argued that infants use prosody to parse speech into smaller units for analysis. He, along with colleagues, reported that 4.5 month old infants illustrated a preference for artificial pauses at clause boundaries in comparison to pauses at other places in a sentence. By preferring pauses at clause boundaries, this illustrates infants' abilities to discriminate clauses in a passage. This reveals that while infants do not understand word meaning, they are in the process of learning about native language and grammatical structure. In a separate study, Jusczyk reported that 9 month old infants preferred passages with pauses occurring between subject-noun phrases and verb phrases. These results are further evidence of infant sensitivity for syntactic boundaries. In a follow up study by LouAnn Gerken et al., researchers compared sentences such as (1) and (2). The prosodic boundaries are indicated by parentheses.
5. (Joe)(kissed the dog). 6. (He kissed)(the dog).
In (1), there is a pause before the verb <kissed>. This is also the location of the subject-verb phrase boundary. Comparably in (2), which contains a weak pronoun, speakers either do not produce a salient prosodic boundary or place the boundary after the verb <kissed>. When tested, 9 month old infants illustrated a preference for pauses located before the verb, such as in (1). However, when passages with pronoun subjects were used, such as in (2), infants did not show a preference for where the pause occurs. While these results again illustrate that infants are sensitive to prosodic cues in speech, they introduce evidence that infants prefer prosodic boundaries that occur naturally in speech. Although the use of prosody in infant speech processing is generally viewed as assisting infants in speech parsing, it has not yet been established how this speech segmentation enriches the acquisition of syntax.
Critics of prosodic bootstrapping have argued that the reliability of prosodic cues has been overestimated and that prosodic boundaries don't always match up with syntactic boundaries. It is argued instead that while prosody does provide infants and children useful clues about a language, it does not explain how children learn to combine clauses, phrases, and sentences, nor word meaning. As a result, a comprehensive account of how children learn language must combine prosodic bootstrapping with other types of bootstrapping as well as more general learning mechanisms.
Pragmatic bootstrapping refers to how pragmatic cues and their use in social context assist in language acquisition, and more specifically, word learning. Pragmatic cues are illustrated both verbally and through nonlinguistic cues. They include hand gesture, eye movement, a speaker's focus of attention, intentionality, and linguistic context. Similarly, the parsimonious model proposes that a child learns word meaning by relating language input to their immediate environment. An example of Pragmatic Bootstrapping would be a teacher saying the word <dog> while gesturing to a dog in the presence of a child.
Children are able to associate words with actions or objects by following the gaze of their communication partner. Often, this occurs when an adult labels an action or object while looking at it.
- Baldwin  carried out experiments where 18 month olds were shown two novel objects and then concealed them in separate containers. The experimenters would then peek into one of the containers and say, "There's a modi in here"; and then remove both objects from the container and give them to the child. When asked for the "modi", the child would hold up the object that the experimenter had been looking at when they labelled the object.This illustrates how children use eye gaze and labelling to learn the name of novel objects.
- Tomasello and Akhtar  applied a “Show Me Widget” test where a novel and nameless action was performed with a novel and nameless object. The experimenter would perform the action with the object and then pass the object to the child and instruct the child to "widget". The experimenter's behavior before they passed the child the object was manipulated between two conditions:
Action Highlighted Condition: The experimenter would prepare an object the child would use to perform a specific action by correctly orientating the object. The experimenter would then hold out the object and say, "Widget, Jason! Your turn!".
Object Highlighted Condition: The experimenter would not prepare the object for the child and would simply hold out the object to the child and say, "Widget, Jason! Your turn!".
The results from the experiment illustrated that children in the Action Highlighted Condition associated the novel word with the novel action, whereas the children in the Object Highlighted Condition assumed the novel word referred to the novel object. To understand that the novel word referred to the novel action, children had to learn from the experimenter's nonverbal behavior that they were requesting the action of the object. This illustrates how non-linguistic context influences novel word learning.
Observing adult behavior
Children also look at the adults face when learning new words, and this can often lead to better understanding of what that the word means. In everyday speech, mistakes are often made, so why don't children end up learning the wrong words for the targeted things? This may be because children are able to see whether the word was right or wrong for the intended meaning by seeing the adult's facial expressions and behaviors.
- Tomasello and Barton  they performed multiple studies to see if infants could understand whether an action was intentional or accidental, and if they could learn and understand a new verb based on emotional cues.
Verb: Plunk ..."I'm going to plunk Big Bird!"
The adult said this sentence without previously explaining what the verb "plunk" would mean. Afterwards, the adult would do one of two things.
Action 1 She then performed the target action intentionally, saying "There!", followed immediately by another action on the same apparatus performed "accidentally", in an awkward fashion saying "Whoops!"
Action 2 Same as Action 1, however, reversed.
Afterwards, the children were asked to do the same to another apparatus, and see if the children would perform the targeted action.
Verb: Plunk "Can you go plunk Mickey Mouse?"
The results were, that the children were able to understood the intended action for the new word in which they just heard, and performed the action when asked. By watching the adult's behavior and facial expressions, they were able to understand what the verb "plunk" meant, and figure out whether it was the targeted action or the accidental action.
- Akhtar, Carpenter and Tomasello  a similar experiment was carried out that focused on the behaviors of adults and word learning, this time for nouns. In this experiment, two experimenters and a guardian of the child were inside a room, playing with 3 objects. Each object was played with for an equal amount of time, with equal excitement. Afterwards, one experimenter and the guardian would leave, and the experimenter left behind would present a new toy, and play with it with the same excitement as the other objects, for about the same length of time. Afterwards, the other experimenter and the guardian came back, and so began the experiment and the two ways in which the experiment would be carried out, which was labelled the "Language" and "No-Language" condition. These simply mean that in the Language condition, the new toy had a term for itself, while in the No-Language condition, the term was not used.
Language "Look, I see a gazzer! A gazzer!"
No-Language "Look, I see a toy! A toy!"
Afterwards, the adults would leave, then ask the child to bring the new object over. In the Language condition, the child would correctly bring the targeted object over. In the No-Language condition, the child would just randomly bring an object over.
This presents the discovery of two things...
- The child was aware of which object was new for the adults that left the room.
- The child knew that the adult was excited because the object was new, and that is why they would use this new term that they had never heard before.
...and the child was able to understand this based on the emotional behaviors of the adult.
- Hohle, Barbara (2009). "Bootstrapping Mechanisms in First Language Acquisition". Linguistics 47 (2): 359–382. doi:10.1515/LING.2009.013. Retrieved 28 October 2014.
- Pinker, Steven (1984). Language Learnability & Language Development. Harvard University Press.
- Siklossy, L (1976). "Problem-solving approach to first language acquisition". Annals of the New York Academy of Science 280: 257–261. doi:10.1111/j.1749-6632.1976.tb25491.x.
- Saffran, Jenny (1996). "Word Segmentation: The Role of Distributional Cues". Journal of Memory and Language 35 (4): 606.
- Siklossy, Laura (1976). "Problem-Solving Approach to first language acquisition". Annals of the New York Academy of Science 280: 257–261.
- Kiss, George (1973). "Grammatical word classes: a learning process and its simulation". Psychology of Learning and Motivation 7: 1–39.
- Hilary Putnam (1985). Cohen, Robert; Wartofsky, Marx, eds. A Portrait of Twenty-Five Years: Boston Colloquium for the Philosophy of Science 1960–1985. D. Reidel Publishing Company. pp. 41–51.
- Karmiloff-Smith, Annette; Karmiloff, Kyra (2002). Pathways to Language: From Fetus to Adolescent. USA: First Harvard University Pres. pp. 112–114.
- Christiansen, Morten; Monaghan, Padric (2006). Discovering Verbs Through Multiple Cue Integration. Oxford University Press.
- Heike Behrens (2001). Bowerman, Melissa; Levinson, Steve, eds. Language Acquisition and Conceptual Development. Cambridge: Cambridge University Press. pp. 450–474.
- Michael Tomasello (2001). Bowerman, Melissa; Levinson, Steve, eds. Language Acquisition and Conceptual Development. Cambridge: Cambridge University Press. pp. 132–158.
- Gennaro Chierchia (1994). Lust, Barbara; Suner, Margarita; Whitman, John, eds. Syntactic Theory and First Language Acquisition: A Cross-Linguistic Perspective. New Jersey: Lawrence Erlbraum Associates. pp. 301–350.
- Brown, Roger. "Linguistic Determinism and the Part of Speech", 1957
- Naigles, L. (1990). "Children Use Syntax to Learn Verb Meaning". Journal of Child Language 17: 357–374. doi:10.1017/S0305000900013817.
- Gleitman, Lila. "The Structural Source of Verb Meanings", 1990
- Quine, W.V. "Ontological Relativity and Other Essays", 1969
- Soja, N "Inferences about the meanings of nouns: the relationship between perception and syntax", 1992
- Syrett, Kristen & Jeffrey Lidz."30-Month-Olds Use the Distribution and Meaning of Adverbs to Interpret Novel Adjectives", 2010
- Shi, Rushen, James L. Morgan, Paul Allopena. (1998). Phonological and acoustic bases for earliest grammatical category assignment; a cross-linguistic perpective
- Gleitman, Lila; Wanner, Eric (1982). Language Acquisition: The State of the Art. Cambridge, MA: Cambridge University Pres.
- Cross, Ian (2009). "Communicative Development: Neonate Crying Reflects Patterns of Native-Language Speech". Current Biology 19: R1078–R1079. doi:10.1016/j.cub.2009.10.035.
- Kuhl, P.H.; Miller, J.D. (1982). "Discrimination of Auditory Target Dimensions in the Presence or Absence of Variation in a Second Dimension by Infants". Perception & Psychophysics 31: 279–292.
- Kempe, Vera; Schaeffler, Sonja; Thoresen, John (2010). "Prosodic Disambiguation in Child-Directed Speech". Journal of Memory and Language 62: 204–225. doi:10.1016/j.jml.2009.11.006.
- Soderstrom, M.; Blossom, M.; Foygel, R.; Morgan, J.L. (2008). "Acoustical Cues and Grammatical Units in Speech to Two Proverbal Infants". Journal of Child and Language 35: 869–902.
- Soderstrom, Melanie; Seidl, Amanda; Kemler Nelson, Deborah G.; Jusczyk, Peter W. (2003). "The Prosodic Bootstrapping of Phrases: Evidence from Prelinguistic Infants". Journal of Memory and Language 49: 249–267. doi:10.1016/S0749-596X(03)00024-X.
- Jusczyk, P.W.; Hohne, E.; Mandel, D. (1995). "Picking Up Regularities in the Sound Structure of the Native Language". Speech Perception and Linguistic Experience: Theoretical and Methodological Issues in Cross-Language Speech Research: 91–119.
- Jusczyk, P.W.; Hirsh-Pasek, K.; Kemler Nelson, D.; Kennedy, L.; Woodward, A.; Piwoz, J. (1992). "Perception of Acoustic Correlates of Major Phrasal Units by Young Infants". Cognitive Psychology 24: 252–293.
- Gerken, L.-A.; Jusczyk, P.W.; Mandel, D.R. (1994). "When prosody fails to cue syntactic structure: Nine month olds sensitivity to phonological versus syntactic phrases". Cognition 51: 537–265.
- Caza, Gregory A.; Knott, Alistair (2012). "Pragmatic Bootstrapping: A Neural Network Model of Vocabulary Acquisition". Language Learning and Development 8 (2): 113–135. doi:10.1080/15475441.2011.581144. ISSN 1547-5441.
- Baldwin, D.A. (1993). "Early Referential Understanding: Infants' Ability to Recognize Referential Acts for What They Are". Developmental Pscyhology 29: 832–843. doi:10.1037/0012-16188.8.131.522.
- Tomasello, Michael; Akhtar, Nameera (1995). "Two-year-olds use pragmatic cues to differentiate reference to objects and actions". Cognitive Development 10 (2): 201–224. doi:10.1016/0885-2014(95)90009-8. ISSN 0885-2014.
- Tomasello, Michael (2000). "The Social-Pragmatic Theory of Word Learning". Pragmatics : quarterly publication of the International Pragmatics Association 10: 59–74.
- Tomasello, Michael; Barton, Michelle E. (1994). "Learning words in nonostensive contexts.". Developmental Psychology 30 (5): 639–650. doi:10.1037/0012-16184.108.40.2069. ISSN 0012-1649.
- Akhtar, Nameera; Carpenter, Malinda; Tomasello, Michael (1996). "The Role of Discourse Novelty in Early Word Learning". Child Development 67 (2): 635–645. doi:10.1111/j.1467-8624.1996.tb01756.x. ISSN 0009-3920.