Tuesday, April 22, 2014

It's turtles round and round

Part I
I've been trying to understand categories better, and one of the books I've been reading in pursuit of this goal is George Lakoff's Women, Fire, and Dangerous Things. In fact, a few nights ago, I fell asleep reading it, and it must have stirred something in my mind because the next morning in the shower, it occurred to me that perhaps categories are just a distraction and it's really properties we should be looking at.
The category of red things is just a human convenience. But red is a property. Almost immediately, though, I realized that red is a category of electromagnetic waves and that electromagnetic waves are themselves a category. And from there, well, it's turtles all the way down. I set the idea aside as I dried myself and got ready to leave home.

Part II
When I got to work, our Blackboard system was down, so I couldn't do the grading I had intended to do. Distractedly I opened the Simple English Wiktionary and saw that somebody had edited the entry for preposition pace. The change was an improvement on, what I thought was a rather odd previous definition. Going through the history, though, I noticed that the older definition was one I had provided. Curious about what I had been thinking, I went to the OED's entry for pace. There I found the following example:
1995   Computers & Humanities 29 404/1,   I do not believe, pace Peirce and Derrida, that it is signs all the way down.
This struck me as a huge coincidence. The expression shows up in the Corpus of Contemporary American English about once per 150 million words. I had encountered it, or a variation thereof, twice in a morning.

Part III
I looked up the expression and found that Wikipedia has an entry (linked above). One of the citations listed there is due to John (Haj) Ross's 1967 linguistics dissertation Constraints on Variables in Syntax. I followed the link and opened up his dissertation, which indeed contains the story with the line "It's turtles all the way down."
On the next page was the Acknowledgements, which list a number of linguists, but on page x, Ross writes,
This thesis is an integral part of a larger theory of grammar which George Lakoff and I have been collaborating on for the past several years.
This is, of course, the same Lakoff whose book I had been reading the night before.

Friday, April 11, 2014

Looking to the futurate

The verb look has been used to talk about the future for a long time. Perhaps the most common use is in the expression look forward to (something). This use may be based on the metaphor that time is a landscape we move through. As such, our future should be visible to us. This is probably the same metaphor that underlies the use of go for the future in expressions like we're going to get to that in a moment.

Despite its venerable history, futurate look began a significant upsurge in about 1980, particularly, in the looking + to infinitive construction.
I noticed that this seems to be particularly common with are. Especially, you are. And even more specifically if you are. By this time we are looking at a small minority of the cases. But I wondered if it might tell us something about the meaning of the looking futurate as opposed to the going futurate. When I started looking at various corpus genres, though things got a little to complex. Maybe you have some thought to add.

Tuesday, April 08, 2014

The opacity of etymology

The word disseminate is a familiar one. It appears hundreds of times on my hard drive and in well over 20 email messages I've read or written in the last five years. But until today, I had never seen the seeds in the word.

We often use a plant metaphor to talk about words. Morphology is a branch of linguistics just as plant morphology is a branch of biology. Both sciences talk of roots and stems, but in linguistics, seeds aren't part of the metaphor.

The root of disseminate though is semin or semen, from the Latin word meaning seed. Dissemination is the spreading of seeds. Semin is also the root of the word seminary, literally a seed plot, but now metaphorically used to mean a place to train priests. This is also where seminar comes from. I hadn't connected up these words either.

Disseminate appears in a passage that my level-8 class is studying. It's a passage that I've been over many times with other classes, and I have had to explain the word before. But never before have I made this connection.

How could this be?

On the flip side of this are cases of people seeing connections where there are none. Consider the regular flare ups about the word niggardly based on a mistaken perception that it's based on a racial slur. There are so many opportunities for false positives. The string semen, for instance shows up in a variety of words such as basement and horsemen, and nobody would ever think it referred to seeds.

How does this noticing thing work?

Friday, September 06, 2013

Friday, June 28, 2013

New issue of Contact available

The newest issue of TESL Ontario's Contact magazine is now available. This is the research symposium issue, based on talks given each fall at TESL Ontario's conference. The issue is edited by Hedy McGarrell and David Wood. The authors are: Dianne Larsen-Freeman, Antonella Valeo, Farahnaz Faez, Douglas Fleming, Alister Cumming, Robert Kohls, Hedy McGarrell, David Wood, and Randy Appel.

Please, check it out.

Monday, May 27, 2013

Then: the meaning changes

A student wrote to me asking about the meaning and placement of then in this sentence.
The product is then designed with this target price in mind.
As I was explaining that then could be resultative (meaning something like "so that" or "in that case") or simply temporal (meaning roughly "next" or "after that"). I also pointed out that it could occur initially, medially (in a number of places), or finally:
1 The product is 2 designed with this target price in mind 3.
Then, it occurred to me that position 1 could only be interpreted as being the temporal meaning, position 2 was ambiguous, and position 3 strongly suggests the resultative meaning.

Is it just me?

Tuesday, March 05, 2013

New issues of Contact available

I'm now into my second year as editor of TESL Ontario's Contact magazine. You can download issues for free as far back as 2000. I've been the editor for just over a year now (volumes 38 & 39). We just put out the conference issue today, as we do every spring. I hope you enjoy it. Please, leave me a comment to let me know what you thought.

Monday, October 29, 2012

Education First's English Proficiency Index

Last year, I blogged about the first English Proficiency Index here. Education First has now released their second index, and they're getting some good publicity out of it (I'm linking to them). The same problems as last year occur: self selection of participants, and need for a computer with internet access to do the test. There's also no data on test reliability and no validity arguments. Still, with 1.7 million participants over three years in more than 50 countries, you wouldn't want to completely dismiss the results either.

Thursday, October 25, 2012

ER-Central.com


Another new and potentially useful site:
Extensive Reading Central is a not-for-profit organization dedicated to developing an Extensive Reading and Extensive Listening approach to foreign and second language learning. It was started by Dr. Rob Waring of Note Dame Seishin University, Okayama, Japan and Dr. Charles Browne of Meiji Gakuin University, Tokyo, Japan as a free service to the EFL community.

Tutela

Tutela has been in beta for about a year but had its official launch during the TESL Canada conference. From their terms of use:
Tutela.ca (“Service”) is a Canadian not-for-profit online repository and community for ESL (English as a Second Language) and FSL (French as a Second Language) professionals registered to use the Service (“Users”). As a repository, the Service provides Users with access to ESL and FSL materials including classroom materials, lesson plans, assessment information, and reusable learning objects. As a community, the Service enables Users to share materials, discover new approaches, locate solutions, and network including through the use of the online meeting and webinar conferencing capabilities of the Service (“Conferencing”). The Service is supported by funding from Citizenship and Immigration Canada (“CIC”) and is owned and operated by Citadel Rock Online Communities Inc. (“Citadel”). (my bold)
I'm not really sure how this works. Tutela itself is not for profit, but Citadel is a privately owned corporation, for profit as far as I can tell. It has received a number of grants from the federal government. Just so as you know...

Saturday, October 20, 2012

More Google Ngrams 2.0 POS tagging


As I wrote yesterday, there are some strange tagging decisions concerning determinatives in this corpus. It seems though, as I added in an update, that these are largely the fault of the Part-of_Speech Tagging guidelines for the Penn Treebank Project.

The problems, though, are not limited to determinatives. Subordinators are also affected. The words that, whether, and if (e.g., They told me that it was OK and They asked me whether/if it was OK) are tagged as _ADP_ (short for ADPOSITION, a more inclusive term for prepositions.) The guidelines say:
"We make no explicit distinction between prepositions and subordinating conjunctions. (The distinction is not lost, however - a preposition is an IN that precedes a noun phrase or a prepositional phrase, and a subordinate conjunction is an IN that precedes a clause).
The preposition "to" has its own special tag TO."
This makes good sense for words like because, after, and since, which have been treated as "subordinating conjunctions" but really are prepositions and function as the heads of preposition phrases. It doesn't work, though, with that, whether, and if, which function as markers of subordination and not heads. Consider the difference between these two clauses:

Language Learner Literature Award Winners

Somehow I missed the announcement, but the 2012 LLL Award Winners for books published in 2011 have been announced.

Friday, October 19, 2012

Google Ngrams 2.0 and POS tagging

As Ben Zimmer blogged yesterday, there's a new and improved version of the Google Ngram viewer. The "improved" bit has a number of elements, but one is POS tagging. This is a wonderful thing, and I'm inordinately happy about it. Unfortunately, there are some very odd quirks to deal with.

Thursday, October 18, 2012

Impacting (on) complement choice

After 20 years of teaching English, I've only recently become consciously aware that my students may be applying the complementation pattern of one word to its derivationally related partner. That is, they regularly say and write things like *it influenced on the situation using a prepositional phrase complement headed by on where the verb should really just take a direct object. Notice, though, that the noun influence licenses the on PP complement that the students are using (e.g., it had no influence on the outcome).

Thursday, September 20, 2012

An adverb caught modifying a noun

This morning, CBC Toronto's Metro Morning program was being broadcast remotely from the new arts and culture hub of Regent Park. Shortly before 6:00, I believe, the host, Matt Galloway, said something along the lines of "We're here at the opening officially of the new arts and culture hub." I'm going from memory, but I'm certain about the opening officially of....

What we have here is an adverb officially post modifying a noun opening. Most grammars will tell you adverbs aren't supposed to do that. It even sounds vaguely obscene. But I tell you, it's the truth.

Now, obviously, opening is deverbal, but you could say the official opening of... with the expected modification by an adjective, and you couldn't say *the officially opening of... with the adverb before the noun. More importantly, an example like *the running quickly of the race seems highly unlikely, so being deverbal isn't the whole deal.

You may think that officially must be modifying something else in the sentence, but since this is between the noun and the modifying of phrase, it's clearly internal to the noun phrase and the most obvious thing for it to be modifying in there is the head noun. When they post the podcast tomorrow, I'll try to find the exact sentence. I think you'll find that there's really nothing else in the sentence that officially could possibly have been modifying.

[Update: the podcast is now available, but it's only 25 minutes of highlights, and apparently interesting grammatical features don't qualify. A google search, though, turns up a number of relevant instances elsewhere.]

Friday, June 08, 2012

"Common" idioms


In the new issue of TESOL Journal is an article by Fatima Alali and Norbert Schmitt that discusses teaching idioms. "Because many idioms occur relatively rarely, each idiom’s frequency was checked in the British National Corpus to make sure it was relatively common" (p. 158). I've listed below the first 10 idioms used along with their frequencies in the BNC and COCA. For each corpus, I present the frequency per million words in the genre where the individual idiom appears most frequently. Sometimes this was spoken, but it varied a lot from idiom to idiom. With verb-based idioms, the frequency includes all forms of the verb.

Idiom
COCA
BNC
[pay] the piper
0.29
0.69
[make] no bones about it
0.18
0.30
[mend] fences
0.44
0.57
life and limb
0.34
1.15
ivory tower
0.87
1.45
down in the dumps
0.25
0.48
[dwell] on the past
0.16
0.19
[cast] a long shadow
0.24
0.29
run of the mill
0.22
0.48
off the hook
4.00
3.39
Note. All frequencies are per million words.

Do these strike you as "common"?

The paper concludes, "formulaic language is an important component of discourse," a point with which I take no issue, but this is not the same as saying that particular formulas are important. The study makes a number of good points, but the focus, the teaching of idioms, is one that needs a rethink, not so much in how its done but in why it is done at all.

Past posts on idioms here.

Tuesday, April 10, 2012

Mark Davies' new academic word lists

Mark Davies, over at Brigham Young University, has developed some fantastic corpus-based resources. His most recent project, along with Dee Gardner, is a set of academic word lists (notice the plural).

In the comparison they provide with Coxhead's 2000 AWL, they claim,
our word lists provide better coverage of academic English. The 570 "word families" in the AWL cover 7.2% of the words in the COCA academic texts, but the top 570 word families in our list cover 14.0% -- nearly twice as much. In a "neutral" corpus -- the 32 million words of academic and semi-academic texts in the British National Corpus -- the AWL covers 7.1% and our list covers 14.0% -- again nearly twice as much.
I haven't had an opportunity to look at these carefully, but at first glance, this seems like a very unfair comparison. It seems that part of the way they have achieved the very high coverage rate is to include some very frequent words, words like between, low, need, difference, use. 

Coxhead's list is built on top of the West's General Service List, which is to say that it only includes words not already listed in the roughly 2,000 words of the GSL. As a result, more very frequent words are excluded (although others, such as area, which are very common but were excluded from the GSL because they had significant semantic overlap with another word, give the AWL an undeserved coverage boost). The approach taken by Davies and Gardner doesn't have any frequency ceiling at all. Rather, they have chosen to consider any word that occurs at least 1.5 times more frequently in the academic sub corpus of the COCA than in the other sections of the corpus. This captures words that have something of an academic proclivity, but a number of those words are vehemently everyday vocabulary.

On the other hand, the new lists distinguish between lexical categories. That is to say that noun use is considered academic, while the verb use is not. Similarly, it brings word families together while still distinguishing between different members. Thus, under the headword move, neither the noun nor the verb move are academic, but movement is.

This is quite a different approach, and will take some time to evaluate. I'm looking forward to trying though.

PS, the lists have been published minus every fifth word as a sort of embargo until a paper describing the lists can be published.

Sunday, April 01, 2012

Grammatically speaking and number

I have often been critical of the "Grammatically Speaking" columns put out occasionally by TESOL, but the most recent is, I think clear, accurate, and interesting. It looks at the the question of why we say zero degrees instead of zero degree. Something I've brought up here before.

Schmitt points out that the terms plural and singular may be misleading, and suggests singular and nonsingular. This I think, is a useful approach.

As for his brain teaser, these usually strike me as fairly obvious, but this time, I have no idea what the issue is. Any insight?

Look at the two example sentences below. Explain what grammatical holdover they illustrate and suggest how a teacher might teach this to a language class.
  1. Here's why working at home is both a curse and a blessing.
  2. In particular, Biden cited the billions of dollars in government financial support for U.S. automakers during the recession as an example of the differing approaches between the parties.

Effectiveness of LINC programs

It's rare to find a study looking at overall program effectiveness. But Citizenship and Immigration Canada has conducted just such a study of their Language Instruction for Newcomers to Canada (LINC) program, and the results are interesting. They look at a variety of elements including costs, intake type, and provision of daycare. What interested me most, though, was the language learning outcomes, as displayed in the graph below.

Friday, March 23, 2012

2012 Language Learner Literature Awards Finalists

The Extensive Reading Foundation today announced the finalists for the 2012 Language Learner Literature awards. They are reproduced below the jump.

Some of these look well worth adding to the library. In particular, I'd like to check out Arman's Journey by Phillip Prowse and Solo Saxophone by Jeremy Harmer. Why we need yet another graded reader version of Call of the Wild, A Christmas Carol, and The Great Gatsby, though, I'm not sure. Oxford, Penguin, and Heinle all have versions of Call of the Wild, for example, and I'm sure there are other publishers with their own simplifications.