tag:blogger.com,1999:blog-31830497.post7000954589511588932..comments2024-02-28T05:25:12.859-05:00Comments on English, Jack: `I' vs `the'Bretthttp://www.blogger.com/profile/02870575277556244419noreply@blogger.comBlogger6125tag:blogger.com,1999:blog-31830497.post-68402880175792636652011-11-08T05:43:56.656-05:002011-11-08T05:43:56.656-05:00Yes, it does seem that he's got it backwards.Yes, it does seem that he's got it backwards.Bretthttps://www.blogger.com/profile/02870575277556244419noreply@blogger.comtag:blogger.com,1999:blog-31830497.post-47649729060373339692011-11-08T01:15:40.340-05:002011-11-08T01:15:40.340-05:00Mark Brysbaert's table reads as an "I&quo...Mark Brysbaert's table reads as an "I"/"the" ratio to me.Michael Vnuknoreply@blogger.comtag:blogger.com,1999:blog-31830497.post-65638358457991377572011-09-22T12:06:45.848-05:002011-09-22T12:06:45.848-05:00"Would the spoken corpora transcribe /dʌ/ or ..."Would the spoken corpora transcribe /dʌ/ or /nʌ/ as "the"? "<br /><br />I would really depend on the corpora and its purpose. The BNC includes articles: de, ze, t', ta, na, th', & nu, but these make up only a few hundred words.Bretthttps://www.blogger.com/profile/02870575277556244419noreply@blogger.comtag:blogger.com,1999:blog-31830497.post-5631885801147157182011-09-22T09:17:13.374-05:002011-09-22T09:17:13.374-05:00One has to wonder about the transcriptions of the ...One has to wonder about the transcriptions of the spoken corpora. Google ngrams has "the" hovering around 5% in all subsets and "I" indistinguishable from 0%. Would the spoken corpora transcribe /dʌ/ or /nʌ/ as "the"?Faldonehttps://www.blogger.com/profile/12873736640907864834noreply@blogger.comtag:blogger.com,1999:blog-31830497.post-16244340525919133332011-09-20T11:32:00.940-05:002011-09-20T11:32:00.940-05:00It's a possibility, but one hopes that he has ...It's a possibility, but one hopes that he has addressed this. Indeed the results from very social spoken corpora match his results.Bretthttps://www.blogger.com/profile/02870575277556244419noreply@blogger.comtag:blogger.com,1999:blog-31830497.post-54666663986675103222011-09-20T11:22:50.489-05:002011-09-20T11:22:50.489-05:00My first guess would be that he is looking for the...My first guess would be that he is looking for the strings "i" and "the" with no attempt to eliminate the occurrences of those string embedded in words. In that sentence there are seven occurrences of "i"<br /> and three of "the". I would have gotten another "the" if there had been the word "there" in there somewhere.Faldonehttps://www.blogger.com/profile/12873736640907864834noreply@blogger.com