/
Using large online corpora for language teaching and learning Using large online corpora for language teaching and learning

Using large online corpora for language teaching and learning - PowerPoint Presentation

mindeeli
mindeeli . @mindeeli
Follow
343 views
Uploaded On 2020-06-23

Using large online corpora for language teaching and learning - PPT Presentation

Mark Davies Brigham Young University Provo Utah USA httpcorpusbyuedu GDUFS December 2014 Review of COCA Grammar must V end up V ing get V ed try and V Collocates break ID: 783814

texts sample frequency corpus sample texts corpus frequency http corpora collocates synonyms www byu biology words text html wikipedia

Share:

Link:

Embed:

Download Presentation from below link

Download The PPT/PDF document "Using large online corpora for language ..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

Using large online corpora for language teaching and learning

Mark Davies

Brigham Young University

Provo, Utah, USA

http://corpus.byu.edu

GDUFS / December 2014

Slide2

Review of COCA

Grammar: must V, end up V-

ing

, get V-

ed

, try and V

Collocates:

break

, brooding,

bodice, cause

, visibly

Concordance lines:

budge, diametrically

Synonyms:

strong, utter

/

sheer

Frequency:

a lot of, attitudinal, somewhat

ADJ

Slide3

Finding the right word

“potent” argument

“tough” regulations

Synonym chains:

precarious

Slide4

www.WordAndPhrase.info

Browse frequency lists (1 – 60,000)

Input and analyze texts

Does much of what COCA does, but all on one page

Frequency (by genre)

Definition

Collocates

Concordance lines

Synonyms

Slide5

www.wordAndPhrase.info

Overview

Frequency

(help pages)

Enter texts (saved: fitness, vent cap, fiction)

Academic: frequency (help pages)

Academic: enter texts (saved:

Sci

, Med)

synonyms; e.g. range ~2600:

incorporate, assign, valuable, classic, assumption, barrier

Slide6

Sample text 1

China/US climate change accord

http://corpus.byu.edu/texts/

climateChange.html

Keywords

mitigate (all)

greenhouse (concordances)

devastating (synonyms)

ire (draw as collocate)

emissions (collocates; click)

a “considerable” amount (other options)

somewhat [ADJ] (genres)

Slide7

Sample text 2

http://www.cbsnews.com/news/comet-philae-lander-survive-we-need-to-be-very-lucky

/

http://corpus.byu.edu/texts/cometLander.html

keywords

forestall, rouse, jumble, illuminated (all)

precisely, secondary (genres)

Slide8

Sample text 3

http://www.zdnet.com/android-lollipop-users-warn-of-unusable-devices-after-upgrading-7000035977

/

http://corpus.byu.edu/texts/androidDevices.html

Slide9

Sample compositions

#1

various competition

a lot of

no matter what

#4

strongly advocate

for (ok)

advocate that (?)

opportunities

* people

Slide10

Sample texts: Wikipedia

fungus

internal combustion engine

Battle of Gettysburg

Yao Ming

Slide11

The Wikipedia Corpus

Almost done: December 2014

1.9 billion words, 4.6 million articles

Can quickly and easily create personalized, virtual corpora (e.g. electrical engineering, biology, automobiles, finance, Star Trek)

Search within corpora

Compare across corpora

Create keywords

Slide12

The Wikipedia Corpus

Sample: [investment] (words)

Sample:

buddh

* (words)

Sample: biology (titles)

Sample:

audi

porsche

(titles)

Search within corpus: biology (collocates

cell

)

Compare frequency across corpora (

studies

)