to top...

 

 

 

 

 

 

 

 

 

 

 

 

                  1. Existing resources on DDL / Graz 2000

  

Online Resources / CONCORDANCERS

Online access to corpora (online concordancing) 

·  Simple searches on the British National Corpus (BNC) at Oxford: You can search for a single word or a phrase, for example Dogged or brown bread, but also use tags & wildcards to search for parts of speech etc.

·  COBUILD Direct Corpus Sampler: The Cobuild Direct corpus is composed of 50 million words of contemporary written and spoken text. To get a flavour of the type of linguistic data that a corpus like this can provide, you can type in some simple queries here and get a display of concordance lines from the corpus. The query syntax allows you to specify word combinations, wildcards, part-of-speech tags, and so on.

·  Linguistic Data Consortium (University of Pennsylvania): Access to North American Corpora - LDC Online - LDC-Online is a free service for LDC current year members. An interactive tutorial is available to everyone, as is a guest account permitting access to the Brown text corpus, the TIMIT speech corpus and the Switchboard corpus

Online texts 

·         Bibliomania - a great starting point with a huge reference section, over 60 classic novels on-line, as well as important classic non-fiction works including Biography, Science, Economics and Ancient Texts, plus poetry. plus Shakespeare ~ (In)complete Works, plus ...

·         The Book Stack - an index to on-line books

·         The On-Line Books Page - this index includes more than 9000 English works in various formats. All should be free for personal, noncommercial use. You can:

 

·         A few special exhibits have been prepared in collaboration with the On-Line Books Page:

  • Project Gutenberg - The Project Gutenberg Philosophy is to make information, books and other materials available to the general public in forms a vast majority of the computers, programs and people can easily read, use, quote, and search. There are three portions of the Project Gutenberg Library, basically be described as:  1) Light Literature; such as Alice in Wonderland, Through the Looking-Glass, Peter Pan, Aesop's Fables, etc. 2) Heavy Literature; such as the Bible or other religious documents, Shakespeare, Moby Dick, Paradise Lost, etc. 3) References; such as Roget's Thesaurus, almanacs, and a set of encyclopedia, dictionaries, etc. 

 

Tim Johns Data-driven Learning Page



  • You may be interested in a bibliography of work in this area.
  • I have been putting together a Virtual DDL Library containing samples of concordance-based teaching and learning materials. I'd be interested to hear from other practitioners who might be prepared to submit examples of their work for inclusion in the Library (with full acknowledgment, of course).
  • Here are samples of DDL materials produced by participants in a workshop in Usti nad Labem (North Bohemia) 21st-25th March 2000.
  • Here is a description of the work undertaken at Birmingham under an EU-funded Lingua-Socrates project on the development of a Windows-based MultiLingual Parallel Concordancer for Classroom Use. See also a review of the program for CTI, Hull. The program is available from CFL Software Development. Texts for use with the program may be downloaded from the Parallel Texts Library as they become available.

A demonstration version of Muticoncord is now available free of charge from this site. It has all the features of the full version, except that it will work only with the three short texts in English, French and German supplied. The program together with documentation and texts is available in a compressed self-extracting file MULTDEM.EXE (278 k). .

  • Oxford University Press carry information on the availability of the downloadable (and upgradable) shareware version of Wordsmith Tools (Version 2.0), Mike Scott's brilliant Windows program which produces concordances, word lists, and key word lists.
  • OUP have withdrawn Microconcord from their catalogue. Mike Scott and I are hoping that we can continue to make it available, possibly over the Web: watch this space!.
  • For examples of concordance data being used in one-to-one teaching and learning, see Kibbitzer on my EAP Page: and for a (downloadable) CALL program based on concordance output see Contexts on my CALL page.
  •  

Birmingham University English Language Research Journal 4 Classroom Concordancing (eds. Johns & King) is still available: £5.50 post paid from English Language Research, University of Birmingham, Birmingham B15 2TT, United Kingdom.

All of us concerned with corpus linguistics at Birmingham University - and many further afield - were saddened by the death of Eugene Winter at the age of 73 on 20th December 1996. An Honorary Research Fellow of the School of English since his retirement from Hatfield Polytechnic, Eugene was one of the few totally original minds in British linguistics and continued to generate ideas and enthusiasm to the end: for a scholarly assessment by Michael Hoey of his remarkable contribution, click here. On a personal note, I shall always be grateful to Eugene's memory for the interest and encouragement he showed in the development of the work reported on this page. I miss him.



Back to Tim Johns Home Page

Last updated 1st August 2000

Chris Tribble (1997) has published an interesting manual on Using Concordances in the Classroom /literature on DDL

 

Further reference and links to relevant websites.

 

For further information, we recommend the following sites on the Internet:

Joseph Rezeau is a teacher who uses computers in language learning. His website is very useful:
Go directly to his examples for concordances.
Look at Joseph's sample exercise on is feeling vs. feels:
The father of data-driven learning, Tim Johns, has lots of useful information on his website:
Here are a few websites with access to on-line texts and corpora:
COBUILD Bank of English:
The British National Corpus:
Where to buy the software? Go to CAMSOFT or to WIDA SOFTWARE!!!

 

Software / to be bought from

The following addresses might become useful when buying software:

 

 

GOOGLE: the Search Engine we found most useful

 

 

 

 

 

 

 

RESOURCES  ON TOOLSA classic example for authoring tools for language learning is the WIDA Software Authoring Suite

 

 

Programming tools such as TOOLBOOK or MACROMEDIA DIRECTOR.

 

 

… a more recent example with additional and more flexible options to put together complete and structured multimedia enhanced learning packages is the TELOS package.

 

 

 

 The Half-baked Potatoes package, another example which is downloadable free of charge off the internet (...), allows teachers to create an interactive learning environment for self-study and place it on the web. 



Online Resources (tools)

Hot Potatoes (halfbaked potatoes)

As far as the use of authoring tools is concerned, the group dealing with data-driven learning at Workshop 8/2000 decided to start with the freeware Hot Potatoes suite. Ths tool was chosen, because it is not only easy to use but also available for free for educational institutions. You can visit the halfbaked potatoes website at the University of Victoria (Canada) and download the software from there. All you have to do then is to register and get a registration key. Hot Potatoes remains free of charge for non-profit educational users who make their pages available on the web.

An important aspect is that the team behind the authoring suite keeps adding to the authoring tools. A new applications under development by the Hot Potatoes team is the Quandary, an authoring tool for creating Web-based action mazes. Also, the multimedia potential of Hot Potatoes is constantly being expanded.

 

  • Halfbaked Potatoes. The freeware Hot Potatoes suite includes six applications, enabling you to create interactive multiple-choice, short-answer, jumbled-sentence, crossword, matching/ordering and gap-fill exercises for the World Wide Web.
  • Sample exercises: an important service provided by the Hot Potatoes team is a list of links into Sites & Exercises built with Hop Potatoes around the world. This list is regularly updated and added to.
  • The English Zone: an intertesting site which can be referred to as a sample for others intending to create learning materials and sites for language learning with Hot Potatoes.

Other Authoring Tools for CALL Software are listed on the following Website:

·         AUTHORING TOOLS FOR CALL COURSEWARE: There are several commercially available Web-development tools that are not specifically designed to create language lessons. Since none of them contain any predefined templates authors have to develop all lesson components using tools that were not especially designed for this task.

 

 

 

 

 

 

 

 

 

 

 

 



 

 

Software

The following addresses might become useful when buying software:

 

  • CAMSOFT: This company produces and retails CALL software and educational software by mail order and offers consultancy and training services in the area of language learning and teaching technology.
  • Wida Software is a team of former language teachers and computer designers. They develop and distribute language learning programs throughout the world. Programs running under Windows and the Macintosh are available by mail order for English as a foreign language, for French, German, Spanish, Italian and Russian. Wida specialises in making authoring programs that enable teachers to build up libraries of exercises closely geared to the needs of their own students. We provide a training service in London or wherever the customer wants it.
  • Both Wida Software & Camsoft offer the MonoConc Concordancing Software at a price of £ 55,-- (1-5 users)

 

 

 

 

 

 

  1. Resources available in Essen Workshop

 

 

Online access to corpora (samples)

·         Simple searches on the British National Corpus (BNC) at Oxford: You can search for a single word or a phrase, for example Dogged or brown bread, but also use tags & wildcards to search for parts of speech etc.

·         COBUILD Direct Corpus Sampler: The Cobuild Direct corpus is composed of 50 million words of contemporary written and spoken text. To get a flavour of the type of linguistic data that a corpus like this can provide, you can type in some simple queries here and get a display of concordance lines from the corpus. The query syntax allows you to specify word combinations, wildcards, part-of-speech tags, and so on.

·         PROJECT GUTENBERG: The Project Gutenberg Philosophy is to make information, books and other materials available to the general public in forms a vast majority of the computers, programs and people can easily read, use, quote, and search. 

 

 

 

 

 

 

 

 

 

 Concordancing Software

·         Concordancer: MonoConc 1.5: 

MonoConc 1.5 is Athelstan's original Windows concordance program. This is a good choice for concordance/computer novices who do not plan to perform many text searches. (The upgrade path to MonoConc Pro 2.0 is only $45.)

The program is shipped with both 16-bit and 32-bit versions on the disk. In other words, it will run on any Windows platform, including Windows 3.1. The basic features of the concordancer are listed here. A brief article on the use of a concordancer in (a href = "http://www.athel.com/teach3.html">language teaching may be of interest

 

·         Concordancer for Windows. Version 3.0

Concordancer for Windows enables one to search for words in text files and display them in the form of concordances. Concordances can be used for text analysis including the investigation of style, grammar usage, vocabulary usage and teaching.

The program has been designed so as to be very easy to use for those who have had little or no previous experience with computers. The program was developed by Zdenek Martinek from the University of West Bohemia, Pilsen, Czech Republic, in close collaboration with Les Siegrist, Technische Hochschule Darmstadt, Germany.

Hardware and software requirements: PC Pentium / 133 MHz or higher, at least 4 MB RAM, VGA, Windows 95. The latest version of the program is Freeware.

Other materials

·         Linguistic Data Consortium (University of Pennsylvania): Access to North American Corpora - LDC Online - LDC-Online is a free service for LDC current year members. An interactive tutorial is available to everyone, as is a guest account permitting access to the Brown text corpus, the TIMIT speech corpus and the Switchboard corpus

·         Concordances of Great Books: Concordances of Great Books offers concordances of more than 300 world-famous books, from Aeschylus to Emile Zola, including many British and American classics. The web site provides a global word occurrence search and phrase search as well as an unabridged dictionary.

·         ICAMET - Innsbruck Computer Archive of Machine-Readable English Texts: The Innsbruck Computer Archive of Machine-Readable English Texts is divided into three subsections, namely the Prose Corpus 1100-1500 (a full-text database), the Letter Corpus 1386-1688 (containing 254 complete letters from different sources, arranged diachronically), and the Prose Varia Corpus (a mixture of tagged, normalized, translated and otherwise manipulated or synopsized texts). Since the texts are offered in their complete versions, they allow literary, historical and topical analyses of various kinds, particularly studies of cultural history.



 

 

 

 3.  More Internet Search on Resources on  DDL   from msn /YC

 

1.    University of Stirling MSc in TESOL: Data-Driven Learning

University of Stirling Centre for English Language Teaching ML24: Data-Driven Learning Welcome to the ML24 home page! This page is intended as a supplementary resource for members of the class. I'll also be using this web page to make available a
www.celt.stir.ac.uk/resources/ML24

 

 

16.    A REVIEW OF THE TYPES OF CALL PROGRAMS FOR

A REVIEW OF THE TYPES OF CALL PROGRAMS FOR VOCABULARY INSTRUCTION Robin Goodfellow Institute of Educational Technology Open University For the International Journal of Computer Assisted Language Learning, special issue on Cognitive Aspects of Second
www-iet.open.ac.uk/pp/r.goodfellow/CALL95/call95.html

 

 

16.    E Warren

Concordance Use in Self Access Grammar Materials Abstract In 1996 the CALLCO Grammar Project was set up by the English for International Students Unit of the University of Birmingham to make self-access computer material for the instruction of
users.ox.ac.uk/~talc98/ewarren.htm

 

 

31.    Lingua Mutilingual Concordancer

Multiconcord: the Lingua Multilingual Parallel Concordancer for Windows Introduction This page describes the work undertaken at the University of Birmingham under Lingua project ndeg.93-09/1245/F-VB (Co-ordinator Francine Roussel, Universite de
artsweb.bham.ac.uk/PKing/lingua.htm

 

 

 

Articles in books

 

Johns, T. (1991) Data-driven learning and the revival of grammar. In: Savolainen, H. and Telenius, J. (eds.), EUROCALL 91: Proceedings. Helsinki: Helsinki School of Economics, 12-22.

 

 

 

106.                 Glossary

ICT4LT: English Glossary This Glossary contains a list of technical terminology and terms specific to ICT and language learning and teaching. For a comprehensive glossary of Web terminology see Matisse Enzer's website: http://www.matisse.net/files/g

www.ict4lt.org/en/en_glossary.htm

 

 

 

Data-Driven Learning (DDL): A concept invented by Tim Johns, University of Birmingham, UK. Closely allied to the use of concordance programs in language learning and teaching, whereby authentic data is held to provide the essential information that learners need. See Tim Johns's bibliography on Classroom Concordancing and DDL at: http://web.bham.ac.uk/johnstf/biblio.htm. See Concordance Program

 

Concordance Program: A concordance program (also known as a Concordancer) operates on a body of texts (a corpus) and is commonly used for compiling glossaries and dictionaries, e.g. by arranging every word in the text alphabetically or in order of frequency, together with its context. Concordance programs also play an important role in language learning and teaching, for example: (i) the teacher can use a concordance program to find examples of authentic usage to demonstrate a point of grammar, typical collocations, etc; (ii) the teacher can generate exercises based on examples drawn from a variety of &corpora; (iii) language learners can work out rules of grammar and usage for themselves by searching for a particular key word in context (KWIC). Concordance programs are allied closely to Tim Johns's concept of Data-Driven Learning (DDL). See Tim Johns's bibliography on Classroom Concordancing and DDL at: http://web.bham.ac.uk/johnstf/biblio.htm. See Module 2.4 and Module 3.4. See also Data-Driven Learning.

Concordancer: See Concordance Program. Authoring Package / Authoring Program: An Application which allows the author to develop learning and teaching materials with significantly less programming than if a programming language were used, or with no programming at all. See Authoring Tool, Programming Language. See Module 2.5 on CALL authoring programs.

Authoring Tool: An Application which allows the author to develop learning and teaching materials with significantly less programming than if a programming language were used, or with no programming at all. See Authoring Package, Authoring Program, Programming Language. See Module 2.5 on CALL authoring programs.

 

 

121.Christopher Tribble, SLALS Associate

Christopher Tribble Visiting Research Fellow Also: Associate Lecturer, King's College, London Outline of work done at SLALS: Chris has performed overseas consultancy work (e.g. revision of Writing Curriculum at Jagellonian University, Poland), and
www.rdg.ac.uk/AcaDepts/cl/slals/tribble.htm

 

 

136.WJMLL 4-5/99-00

The Relevance of Corpora to German Studies by Bill Dodd University of Birmingham E-mail: w.j.dodd@bham.ac.uk Abstract (received October 1999) Copyright Notice: First published in Web Journal of Modern Language Linguistics. © 1999 Bill Dodd. The
wjmll.ncl.ac.uk/issue04-05/dodd.htm

151.Corpora and Concordancing

Corpora and Concordancing Tutorials Corpora and Concordancers Links Journals / Articles / Bibliographies Language Tools Terminological databases / Dictionaries These links contain information and resources which will be of great use to you in your
www.ul.ie/~lcs/francais/CorporaandConcordancing.htm

 

151.Education | New tools for new texts

Materials and methodology New tools for new texts Christopher Tribble explains how software that dissects corpus texts opens up new possibilities for creating materials. See the associated activities Wednesday February 23, 2000 The Guardian For most
www.educationunlimited.co.uk/Print/0%2C3858%2C3966711%2C00.html


 

 

 

Mike SCOTT   Mike.Scott@liv.ac.uk


Applied English Language Studies Unit University of Liverpool Liverpool L69 3BX United Kingdom email MS2928@LIVERPOOL.AC.UK
Research Interests 1. Developing lexical analysis tools for the micro-computer. These include: concordancers, collocation analysis tools, word-listing functions, key-words and key-phrase identifiers, dispersion plotters. Such tools are used by language learners and Applied Linguists working in Translation, Genre Analysis, etc. but can be useful to those in Literature, Political Science, etc. The process of analysis involves what Tim Johns has called "Data-driven Learning" 2. Critical Reading 3. Lexical Inferencing (PhD thesis University of Lancaster 1990)
*******************************************

 

151.The Virtual CALL Library: Multilingual

Virtual CALL Library: Multilingual A selection of software that can be used for more than one language is listed on this page. A great deal of other programs can be used for language education too - flashcards (see reviews), crosswords, adventure
www.sussex.ac.uk/langc/callmult.html

 

 

 

151.Education | New tools for new texts

Materials and methodology New tools for new texts Christopher Tribble explains how software that dissects corpus texts opens up new possibilities for creating materials. See the associated activities Wednesday February 23, 2000 The Guardian For most
www.educationunlimited.co.uk/Print/0%2C3858%2C3966711%2C00.html


 

166.JTAP Case Study: The Web Concordances

Concordance software. Makes concordances and wordlists from electronic text. Can turn your concordance into HTML ready for the web with a single click.
www.dundee.ac.uk/english/wics/casemain.htm

 

181.Manuel Barbera, Corpus based computational linguistic resources. General: Tools (§ 2.5).

Corpus Linguistics Resources Guide, General pages. Tools: taggers, concordancers, parsers, corpus query and maintenance systems, and corpus oriented NLP software
www.bmanuel.org/clr2_tt.html

182.Manuel Barbera, Corpus based computational linguistic resources. General: References (§ 2.4).

Corpus Linguistics Resources Guide, General pages. References, Standards & Educationals: Corpora and Corpus Linguistics; References & Standards; Scripting Languages; Educational Resources
www.bmanuel.org/clr2_rs.html

 

 

 

196.Concordance Help

This HTML file contains all the topics from the help system of Concordance 2.0.0 Contents at a glance Click here for Full Contents Using this help system About Concordance What's new in this version Licence Terms and Conditions Obtaining and
www.concordance.ukgateway.net/Concordance200Manual.htm

 

 

 

Graphic Data Online 

FINANCE TODAY - Graphic Data Online
www.sovereign-publications.com/graphicdataonline.htm

226.Graphic Data Online

FINANCE TODAY - Graphic Data Online
www.sovereign-publications.com/graphicdataonline.htm

 

 

 

241.Acquiring Plausible Unification-Based Grammars using Model-Based and Data-Driven Learning (ResearchIndex)

Undergeneration is a problem that undermines successful parsing of unrestricted texts. A popular solution to this problem is automatic grammar correction or machine learning of grammar Broadly speaking, grammar correction approaches can be classifie
citeseer.nj.nec.com/osborne95acquiring.html

 

 

 

 

 

 

 

 

 

4.  More  Resources  / G OOGLE  / YC

 

Tim Johns Classroom Concordancing Bibliography
Classroom Concordancing/Data-driven learning Bibliography. ... Hadley, Gregory, forthcoming,
'Sensing the Winds of Change: An Introduction to Data-driven Learning'. ...
web.bham.ac.uk/johnstf/biblio.htm - 21k - Cached - Similar pages
[ More results from web.bham.ac.uk ]

 

 

 Botley and Uzar
Higher quality data-driven learning through the testing of definite
and indefinite articles. Abstract. The Universities of Lancaster ...
users.ox.ac.uk/~talc98/botley.htm - 9k - Cached - Similar pages

 

Research
... 931-935, Aalborg, Danmark, September 3-7, 2001 abstract | .ps | ..pdf; Megyesi,
B. 2001. Comparing Data-Driven Learning Algorithms for PoS Tagging of Swedish. ...
www.speech.kth.se/~bea/research.html - 14k - Cached - Similar pages

 

[PDF]Data-Driven Learning: Theory and Classroom Imple- mentation
File Format: PDF/Adobe Acrobat - View as HTML

 

 


11 Data-driven learning: theory and classroom implementation Data-Driven Learning:
Theory and Classroom Imple- mentation TROY BLAPPERT KyungHee University In ...
www.well.com/user/greg/KOTESOL/1997-proceedings/ blappert_troy.pdf - Similar pages