dogelinguistics

The doge meme teaches us so much about language learning and how challenging it can be to accurately combine words and patterns when using another language. The FLAX language system teaches us so much about how we can avoid using dodgy language by employing powerful open-source language analysis tools and authentic language resources.

flaxHeader_leftlinkedup trophyThe FLAX (Flexible Language Acquisition) project has won the LinkedUp Vici Competition for tools and demos that use open or linked data for educational purposes. This post is the one I wrote to accompany our project submission to the LinkedUp challenge.

FLAX is an open-source software system designed to automate the production and delivery of interactive digital language collections. Exercise material comes from digital libraries (language corpora, web data, open access publications, open educational resources) for a virtually endless supply of authentic language learning in context. With simple interface designs, FLAX has been designed so that non-expert users — language teachers, language learners, subject specialists, instructional design and e-learning support teams — can build their own language collections.

The FLAX software can be freely downloaded to build language collections with any text-based content and supporting audio-visual material, for both online and classroom use. FLAX uses the Greenstone suite of open-source multilingual software for building and distributing digital library collections, which can be published on the Internet or on CD-ROM. Issued under the terms of the GNU General Public License, Greenstone is produced by the New Zealand Digital Library Project at the University of Waikato, and developed and distributed in cooperation with UNESCO and the Human Info NGO.

REMIX WITH FLAX

images_entries_entry_image_file_-_entry_id-4433_-_20111221124909164.w_420.h_280.m_crop.a_center.v_topAt FLAX we understand that content and data vary in terms of licensing restrictions, depending on the publishing strategies adopted by institutions for the usage of their content and data. FLAX has, therefore, been designed to offer a flexible open-source suite of linguistic support options for enhancing such content and data across both open and closed platforms.

Featuring the Latest in Artificial Intelligence &

Natural Language Processing Software Designs

Within the FLAX bag of tricks, we have the open-source Wikipedia Miner Toolkit, which links in related words, topics and definitions from Wikipedia and Wiktionary as can be seen below in the Learning Collocations collection  (click on the image to expand and visit the toolkit in action).

wikiminer
Wikipedia Mining Tool in FLAX Learning Collocations Collection – click on the image to expand and visit the collection

Featuring Open Data

Available on the FLAX website are completed collections and on-going collections development with registered users. Current research and development with the FLAX Law Collections is based entirely on open resources selected by language teachers and legal English researchers as shown in the table below. These collections demonstrate how users can build collections in FLAX according to their interests and needs.

Law Collections in FLAX

flaxheadercropped

Type of Resource

Number and Source of Collection Resources

Open Access Law research articles
40 Articles (DOAJ – Directory of Open Access Journals, with Creative Commons licenses for the development of derivatives)
MOOC lecture transcripts and videos (streamed via YouTube and Vimeo)
4 MOOC Collections: English Common Law (University of London with Coursera), Age of Globalization (Texas at Austin with edX), Copyright Law (Harvard with edX), Environmental Politics and Law (OpenYale)
Podcast audio files and transcripts (OpenSpires)
15 Lectures (Oxford Law Faculty, Centre for Socio-Legal Studies and Department of Continuing Education)
PhD Law thesis writing
50-70 EThoS Theses (sections: abstracts, introductions, conclusions) at the British Library (Open Access but not licensed as Creative Commons – permission for reuse granted by participating Higher Education Institutions)
British Law Reports Corpus (BLaRC)
8.8 million-word corpus derived from free legal sources at the British and Irish Legal Information Institute (BAILII) aggregation website
FLAX Wikipedia English
Linking in a reformatted version of Wikipedia (English version), providing key terms and concepts as a powerful gloss resource for the Law Collections.
FLAX Learning Collocations
Linking in lexico-grammatical phrases from the British National Corpus (BNC) of 100 million words, the British Academic Written English corpus (BAWE) of 2500 pieces of assessed university student writing from across the disciplines, and the re-formatted Wikipedia corpus in English.
FLAX Web Phrases
Linking in a reformatted Google n-gram corpus (English version) containing 380 million five-word sequences drawn from a vocabulary of 145,000 words.

FLAX Training Videos

Featuring Game-based Activities

Click on the image below to explore the different activities that can be applied to language collections in FLAX.
flaxactivitiesrevised1flaxactivitiesrevised2

FLAX Apps for AndroidAbout FLAX

We also have a suite of free game-based FLAX apps for Android devices. Now you can interact with the types of activities listed above while you’re learning on the move. Click on the FLAX app icon to the right to access and download the apps and enjoy!

 collocsmatchingapp  collocmatchingapp

FLAX Research & Development

oerresearchhubTo date, we have distributed the English Common Law and the Age of Globalization MOOC collections in FLAX to thousands of registered learners in over a 100 countries – wow!

A collaborative investigation is underway with FLAX and the Open Educational Resources Research Hub (OERRH), whereby a cluster of revised OER research hypotheses are currently being employed to evaluate the impact of developing and using open language collections in FLAX with informal MOOC learners as well as formal English language and translation students.

Radio Ga Ga by Queen via YouTube
Radio Ga Ga by Queen via YouTube

This is the first satellite post from the mothership post, Radio Ga Ga: corpus-based resources, you’ve yet to have your finest hour. I have also made the complete hyperlinked post (in five sections) available as a .pdf on Slideshare.

Radio 1

Original, in-house and live, this station brings us what’s new in the world of OER for corpus-based language resources.

Flipped conferencing

Kicking things off in late March with Clare Carr from Durham, we co-presented an OER for EAP corpus-based teacher and learner training cascade project at the Eurocall CMC & Teacher Education Annual Workshop in Bologna, Italy. This was very much a flipped conference whereby draft presentation papers were sent to be read in advance by participants and where the focus was on discussion rather than presentation at the physical event. Russell Stannard of Teacher Training Videos (TTV) was the keynote speaker at this conference and I have been developing some training resources for the FLAX open-source corpus collections which will be ready to go live on TTV soon. New collections in FLAX have opened up the BAWE corpus and have linked this to the BNC, a Google-derived n-gram corpus as well as Wikimedia resources, namely Wikipedia and Wiktionary. These collections in FLAX show what’s cutting edge in the developer world of open corpus-based resources for language learning and teaching.

Focusing on linked resources: which academic vocabulary list?

In a later post, I will be looking at Mark Davies’ new work with Academic Vocabulary Lists based on a 110 million-word academic sub corpus in the Corpus of Contemporary American (COCA) English – moving away from the Academic Word List (AWL) by Coxhead (2000) based on a 3.5 million-word corpus – and his innovative web tools and collections based on the COCA. Once again, Davies’ Word and Phrase project website at Brigham Young University contains a bundle of powerfully linked resources, including a collocational thesaurus which links to other leading research resources such as the on-going lexical database project at Princeton, WordNet.

The open approach to developing non-commercial learning and teaching corpus-based resources in FLAX also shows the commitment to OER at OUCS (including the Oxford Text Archive), where the BAWE and the BNC research corpora are both managed. Click on the image below to visit the BAWE collections in FLAX.

BAWE case study text from the Life Sciences collection in FLAX with Wikipedia resources

Open eBooks for language learning and teaching

Learning Through Sharing: Open Resources, Open Practices, Open Communication, was the theme of the EuroCALL conference and to follow things up the organisers have released a call for OER in languages for the creation of an open eBook on the same theme. The book will be “a collection of case studies providing practical suggestions for the incorporation of Open Educational Resources (OER) and Practices (OEP), and Open Communication principles to the language classroom and to the initial and continuing development of language teachers.” This open-access e-Book, aimed at practitioners in secondary and tertiary education, will be freely available for download. If you’re interested in submitting a proposal to contribute to this electronic volume, please send in a case study proposal (maximum 500 words) by 15 October 2012 to the co-editors of the publication, Ana Beaven (University of Bologna, Italy), Anna Comas-Quinn (Open University, UK) and Barbara Sawhill (Oberlin College, USA).

MOOC on Open Translation tools and practices

Another learning event which I’ve just picked up from EuroCALL is a pilot Massive Open Online Course in open translation practices being run from the British Open University from 15th October to 7 December 2012 (8 weeks), with the accompanying course website opening on Oct 10th 2012. Visit the “Get involved” tab on the following site: http://www.ot12.org/. “Open translation practices rely on crowd sourcing, and are used for translating open resources such as TED talks and Wikipedia articles, and also in global blogging and citizen media projects such as Global Voices. There are many tools to support Open Translation practices, from Google translation tools to online dictionaries like Wordreference, or translation workflow tools like Transifex.” Some of these tools and practices will be explored in the OT12 MOOC.

Bringing open corpus-based projects to the Open Education community

On the back of the Cambridge 2012 conference: Innovation and Impact – Openly Collaborating to Enhance Education held in April, I’ve been working on another eBook chapter on open corpus-based resources which will be launched very soon at the Open Education conference in Vancouver. The Cambridge 2012 event was jointly hosted in Cambridge, England by the Open Course Ware Consortium (OCWC) and SCORE. Presenting with Terri Edwards from Durham, we covered EAP student and teacher perceptions of training with open corpus-based resources from three projects: FLAX, the Lextutor and AntConc. These three projects vary in terms of openness and the type of resources they are offering. In future posts I will be looking at their work and the communities that form around their resources in more depth. The following video from the conference has captured our presentation and the ensuing discussion at this event to a non-specialist audience who are curious to know how open corpus-based resources can help with the open education vision. Embedding these tools and resources into online and distance education to support the growing number of learners worldwide who wish to access higher education, where the OER and most published research are in English, opens a whole new world of possibilities for open corpus-based resources and EAP practitioners working in this area.

A further video from a panel discussion which I contributed to – an OER kaleidoscope for languages – looks at three further open language resources projects that are currently underway and building momentum here in the UK: OpenLives, LORO, the CommunityCafe. Reference to other established OER projects for languages and the humanities including LanguageBox and the HumBox are also made in this talk.

A world declaration for OER

The World OER congress in June at the UNESCO headquarters in Paris marked ten years since the coining of the term OER in 2002 along with the formal adoption of an OER declaration (click on the image to see the declaration). I’ve included the following quotation from the OER declaration to provide a backdrop to this growing open education movement as it applies to language teaching and learning, highlighting that attribution for original work is commonplace with creative commons licensing.

Emphasizing that the term Open Educational Resources (OER) was coined at UNESCO’s 2002 Forum on OpenCourseWare and designates “teaching, learning and research materials in any medium, digital or otherwise, that reside in the public domain or have been released under an open license that permits no-cost access, use, adaptation and redistribution by others with no or limited restrictions. Open licensing is built within the existing framework of intellectual property rights as defined by relevant international conventions and respects the authorship of the work”.

Wikimedia – why not?

Wikimedia Foundation
Wikimedia Foundation

Earlier in September, I volunteered to present at the EduWiki conference in Leicester which was hosted by the Wikimedia UK chapter. Most people are familiar with Wikipedia which is the sixth most visited website in the world. It is but one of many sister projects managed by the Wikimedia Foundation, however, along with others such as Wikiversity, Wiktionary etc.

I will also be blogging soon about widely held misconceptions for uses of Wikipedia in EAP and EFL / ESL while exploring its potentials in writing instruction with reference to some very exciting education projects using Wikipedia around the world. The types of texts that make up Wikipedia alongside many academics’ realisations that they need to be reaching wider audiences with their work through more accessible modes of writing transmission are all issues I will be commenting on in this blog in the very near future.

Presenting the work the FLAX team have done with text mining, incorporating David Milne’s Wikipedia mining tool, the potential of Wikipedia as an open corpus resource in language learning and teaching is evident. I was demonstrating how this Wikipedia corpus has been linked to other research corpora in FLAX, namely the BNC and the BAWE, for the development of corpus-based OER for EFL / ESL and EAP. And, let’s not forget that it’s all for free!

The open approach to corpus resources development

There is no reason why the open approach taken by FLAX cannot be extended to build open corpus-based collections for learning and teaching other modern languages, linking different language versions of Wikipedia to relevant research corpora and resources in the target language. In particular, functionality in the FLAX collections that enable you to compare how language is used differently across a range of corpora, which are further supported by additional resources such as Wiktionary and Roget’s Thesaurus, make for a very powerful language resource. Crowd-sourcing corpus resources through open research and education practices and through the development of open infrastructure for managing and making these resources available is not as far off in the future as we might think. The Common Language Resources and Technology Infrastructure (CLARIN) mission in Europe is a leading success story in the direction currently being taken with corpus-based resources (read more about the recent workshop for CLARIN-D held in Leipzig, Germany).

Reference:

Coxhead, A. (2000). The Academic Word List.

 

radiogaga1
Radio Ga Ga album cover by Queen via Wikipedia

These past few months I’ve been tuning into a lot of different practitioner events and discussions across a range of educational communities which I feel are of relevance to English language education where uses for corpus-based resources are concerned. There’s something very distinct about the way these different communities are coming together and in the way they are sharing their ideas and outputs. In this post, I will liken their behaviour to different types of radio station broadcast, highlighting differences in communication style and the types of audience (and audience participation) they tend to attract.

I’ve also been re-setting my residential as well as my work stations. No longer at Durham University’s English Language Centre, I’m now London-based and have just set off on a whirlwind adventure for further open educational resources (OER) development and dissemination work with collaborators and stakeholders in a variety of locations around the world. TOETOE is going international and is now being hosted by Oxford University Computing Services (OUCS) in conjunction with the Higher Education Academy (HEA) and the Joint Information Systems Committee (JISC) as part of the UK government-funded OER International programme.

I will also be spreading the word about the newly formed Open Education Special Interest Group (OESIG), the Flexible Language Acquisition (FLAX) open corpus-based language resources project at the University of Waikato, and select research corpora, including the British National Corpus (BNC) and the British Academic Written English (BAWE) corpus, both managed by OUCS, which have been prised open by FLAX and TOETOE for uses in English as a Foreign Language (EFL) – also referred to as English as a Second Language (ESL) in North America – and English for Academic Purposes (EAP). Stay tuned to this blog in the coming months for more insights into open corpus-based English language resources and their uses in different teaching and learning contexts.

This post is what those in the blogging business refer to as a ‘cornerstone’ post as it includes many insights into the past few months of my teaching fellowship in OER with the Support Centre in Open Educational Resources (SCORE) at the Open University in the UK. Many posts within one as it were. This post also provides a road map for taking my project work forward while identifying shorter blogging themes for posts that will follow this one. This particular post will also act as the mother-ship TOETOE post from which subsequent satellite posts will be linked.  Please use the red menu hyperlinks in the section below to dip in and out of the four main sections of this blog post series. I have elected to choose this more reflective style of writing through blogging so that my growing understandings in this area are more accessible to unanticipated readers who may stumble upon this blog and hopefully make comments to help me refine my work. Two more formal case studies on my TOETOE project to date will be coming out soon via the HEA and the JISC.

I have also made this hyperlinked post (in five sections) available as a .pdf on Slideshare.

Which station(s) are you listening to?

BBC Radio has been going since 1927. With audiences in the UK, four stations in particular are firm favourites: youth oriented BBC Radio 1 featuring new and contemporary music; BBC Radio 2 with middle of the road music for the more mature audience; high culture and arts oriented BBC Radio 3, and; news and current affairs oriented BBC Radio 4. Of course there are many more stations but these four are very typical of those found around the world. What is more, I’ve selected these four very distinct stations as the basis to build a metaphor around the way four very distinct educational practitioner communities are intersecting with corpus-based language teaching resources. This metaphor will draw on thought waves from the following:

Radio 1 – what’s new and hip in open corpus-based resources and practices

Radio 2 – the greatest hits in ELT materials development and publishing

Radio 3 – research from teaching and language corpora

Radio 4 – The current talk in EAP: open platforms for defining practice

“I mean somebody with the wit and the guts to go and do and create. And, that I believe is what education is all about” – Gordon Pask, 1974

This post is dedicated to the memory of Gary Boyd who taught me about and inspired me with systems thinking and cybernetics for educational practice. He and Gordon Pask, the guy in the Youtube clip above who on another clip someone commented that he’d make a great Dr. Who character, began the Educational Technology programme at Concordia University in Montreal, the oldest EdTech programme in North America where I am now a student.

IATEFL Conference 2012, Glasgow

This week I’ve been to some very encouraging talks and presentations at this year’s IATEFL conference in Glasgow. A key theme that I’ve found running through all of the sessions I’ve attended thus far is that of experimentation; in learning design, in research, in educational leadership, and but not at all least, in teaching.

Adrian Underhill kicked things off with his opening keynote, Mess and Progress, based on systems thinking for leadership, emphasizing the need for post-heroic leadership and flat hierarchies in the many educational contexts around the world. This notion of experimental leadership at the policy, institutional and individual level was effectively carried forward in the following presentation on Tuesday, ELT in Action, by the international A.S.Hornby Educational Trust Scholars, including speakers from Ethiopia, Nigeria, Sudan, Sri Lanka, India, Venezuela, Mexico, Uzbekistan, Bangladesh, Indonesia, and Yemen. [Find out more about the Hornby scholars in an interview with three participating scholars at this IATEFL conference].

Systems thinking is very closely connected to cybernetics and Gordon Pask in the televised broadcast above was a leading cyberneticist and educational technologist experimenting with processes in education, arriving at his famous conversation theory which the second IATEFL keynote speaker, Diana Laurillard, draws upon in her renowned book, Rethinking University Teaching – A Conversational Framework for the Effective Use of Learning Technologies.

Laurillard’s new book, Teaching as a Design Science – Building Pedagogical Patterns for Learning and Technology sold out at the conference. The book along with the tool she introduced in her talk, the Learning Design Support Environment (LDSE), which you can download for free from the LDSE project website, encourages sharing and collaboration between educators to lead, experiment and innovate with learning technologies and to build collective knowledge in this important area. In order to succeed, this collaborative effort will require openness. On Diana Laurillard’s slides she mentions OER – Open Educational Resources – in several places, but perhaps she needed to gloss OER and open practice more overtly to the 2000+ audience at the IATEFL conference because only one person who came to my session on open corpora and OER for ELT had ever heard of the term before. This is despite the popularity of Russell Stannard’s work with free Teacher Training Videos (TTV) which are OER for teacher and learner training with technology in ELT.

 

Flat Hierarchy

Flat hierarchy is the model followed by and experimented with by open education practitioners who choose to embody openness in their everyday practices for reducing barriers to and increasing access to education for all. Openness promotes “communal management by distributed stakeholders (users/producers/contributors) rather than a centralized authority (owners, experts, boards of directors, etc.)” (definition via Wikipedia on Openness). The term OER was coined ten years ago by UNESCO and this year OER stakeholders will convene virtually and in person in June 2012 to establish further international goals for the open education movement. Watch this space.

Returning to Adrian Underhill’s Mess and Progress, he ended his talk on a high note by performing his song, The Reflective Practice Blues, singing the need “…to reflect and not to neglect to try out something different everyday…” Basically, we can’t afford to continue to work in silos as it’s far too costly, both in the sense of wasting resources through the duplication of effort in creating similar copyrighted resources, and also in the sense of wasting potential opportunities for what could be if we would only open up to sharing what we do and the outputs of what we create so we can experiment with and improve upon these things. Underhill captured this ethos in his keynote with the following awareness-raising activity:

“Talk to the people sitting next to you to identify if you work somewhere where…

  1. it’s easy to get people to listen to an experiment with new ideas and suggestions

  2. when one person learns something new, everyone hears about it

  3. making mistakes is part of learning, you can be open about it and it’s not career limiting

  4. staff members of all ranks give each other plenty of quality feedback from above, below, and sideways

  5. everyone is involved in discussing school policies before adoption

  6. one department knows what people in another department are thinking and they help each other” (Underhill, A. Opening keynote address, IATEFL Conference 2012, Glasgow)

Is this reminiscent of your workplace and who you are in your working practice? Would you consider yourself to be an Open Educational Resource? If not yet or not completely, how can you become an experimenter in your practice to help open things up for yourself and others?

Princess Mary, Girl Guides, 1922 via Wikimedia Commons

Hey, I’m not even British but as part of Open Education Week – March 5-11 – I’ve just signed a pledge with the new UK-based Open Education SIG, an international special interest group with a UK flavour (not flavor:).

I attended a meeting held at the Open University in the UK at the end of February to discuss the future of open education in the UK. I am a teaching fellow with the Support Centre for Open Resources in Education (SCORE), one of about 400 people working in UK higher education who have been involved in government-funded open educational resources (OER) projects over the last three years. When we all made our applications for funding to the Joint Information Systems Committee (JISC) and the Higher Education Academy (HEA) in the UK we also made the usual commitment in our proposals to sustaining our OER projects after their funded lifetimes. So, what better way to reinforce this commitment than by signing a renewed pledge to Open Education? While the Cape Town Open Education Declaration has been picked up by many organisations around the world we thought it would be a good idea to re-mix this declaration to make it more personalised for the educational practitioner.

What does this all mean for English language teaching practitioners?

Frontrunners for technology-enhanced ELT, Russell Stannard and David Deubelbeiss, have also been pushing for more open educational resources and practices within ELT.

Recently, I posted a comment on Scott Thornbury’s A-Z of ELT blog regarding the issues of attribution, re-use and the making of derivative resources for teaching English based on original resources created by another author:

One of the things that interests me most about this post and the comments related to it is the issue of attribution to the original work on automaticity by Gatbonton and Segalowitz. Attribution is essential whether you’re sharing resources in closed teaching and learning environments (e.g. classrooms, password-protected virtual learning environments, workshop and continuing professional development spaces) or through publishing channels using copyright or copyleft licences (e.g. books, research articles, blogs, online forum discussions). There is obviously a great amount of sharing and attribution going on in this discussion and the blogging platform is an enabler for this activity.

What also interests me is the behaviour around resource enhancement. As Scott outlines in the example here, an original resource from a research article by Gatbonton and Segalowitz was re-formatted into a workshop by Stephen Gaies (presumably with attribution to Gatbonton and Segalowitz). This in turn inspired Scott to engage in further resource gathering to inform his teaching practice while applying the five criteria for automaticity, and this further informed the section on fluency in his book, How to Teach Grammar (presumably with attribution to Gaies but now he realises he should’ve included attribution to Gatbonton and Segalowitz). In its latest iteration we find the same criteria for automaticity here in his blog post containing more ideas on how to apply this approach in language learning and teaching from both Scott and his blogpost readers. This is a great example of resource enhancement via re-use and re-mixing, something which the creative commons suite of licences http://creativecommons.org/ allow materials developers and users to do while maintaining full legal attribution rights for the original developer as well as extended rights to the re-mixer of that resource to create new derivative resources.

Legally enabling others to openly re-mix your resources and publish new ones based on them was not possible back in 1988. Arguably, Gatbonton and Segalowitz’s paper with the original criteria on automaticity has stood the test of time because of its enhancement through sharing by Gaies and by the same criteria having been embedded in a further published iteration by Scott in How to Teach Grammar. Times have changed and there is a lot we can now do with digital capabilities for best practice in the use and re-use of resources with attribution still being at the core of the exchange between resource creation and consumption. Except that now with self-publishing and resource sharing platforms, including blogs, it’s a lot easier for all of us to be involved in the resource creation process and to receive attribution for our work in sharing. This coming week, March 5-10, is Open Education Week http://www.openeducationweek.org/ with many great resources on how to openly share your teaching and learning resources along with how to locate, re-use, re-mix and re-distribute with attribution those open educational resources created by others. Why not check it out and see how this activity can apply to ELT?

If you’re new to all of this and have any pesky questions about the business models behind open education, please check out Paul Stacey’s blog, Musings on the Edtech Frontier, with his most recent post on the Economics of Open. Information on what the different Creative Commons and Public Domain licences can be found at CreativeCommons.org.

publicdomain
Public Domain licence via Flickr
creativecommons
Creative Commons licence via Flickr
Attribution, Creative Commons licence
Attribution Creative Commons licence via Flickr
noncommercial
Non Commercial Creative Commons Licence via Flickr
sharealike
Share Alike Creative Commons Licence via Flickr
noderivatives
No Derivatives Creative Commons licence via Flickr

 

 

 

 

 

 

 

 

 

 

 

 

So, why the interest in British resources for open English?

I’ve been coming in and out of the UK for the past 10 years with my work related to technology-enhanced ELT and EAP. Resources include not only those artifacts that we teach and learn with but also the vibrant communities that come together to share their understandings with peers through open channels of practice. BALEAP, formerly a British organisation (the British Association for Lecturers in English for Academic Purposes) but now with an outreach mandate to become the global forum for EAP practitioners, is such an informal community of practice. Members within BALEAP are actively making up for a deficit in formal EAP training by providing useful resources to both EAP teachers and learners via their website and through lively discussions relevant to current issues in EAP via their mailing list.

Because of my interest in corpus linguistics and data-driven language learning, I’ve also been working with exciting practitioners from the world of computer science, namely those working at the open source digital library software lab, Greenstone, at the University of Waikato in New Zealand, to help with the testing and promotion of their open English language project, FLAX (the Flexible Language Acquisition project). The FLAX team are building open corpora and open tools for text analysis using a combination of both open and proprietary content. A copyrighted reference corpus such as the British National Corpus (BNC) is enhanced within the FLAX project by being linked to different open reference corpora such as a Wikipedia and a Web-derived corpus (released by Google) as well as specialist corpora, including the copyrighted British Academic Written English (BAWE) corpus, developed by Nesi, Gardner, Thompson and Wickens between 2004-2007 and housed within the Oxford Text Archive (OTA).

Oxford University Computing Services (OUCS) manage the OTA along with jointly managing the BNC which is physically housed at the British Library. The OpenSpires project is also based at the OUCS and this is where Oxford podcasts have been made openly available through creative commons licences for use and re-use in learning and teaching beyond the brick-n-mortar that is Oxford’s UK campus. Try out the Credit Crunch and Global Recession OER that are based on an Oxford seminar series and have been enhanced with corpus-based text analysis resources. Or, make your own resources based on these same seminars to share with your own learning and teaching communities. In addition to being housed on the OUCS website these resources, along with many other creative commons-licensed resources from educational institutions around the world, can also be found on the Apple channel, iTunesU.

So, it seems there’s quite a bit going on with open English in the UK that’s worth engaging with, and maybe even making a commitment to sharing with open educational resources and practices.

A finale take-away

Check out FLAX’s new Learning Collocations collection where you can compare collocations for keyword searches and harvest useful phrases to embed into your writing, using the BAWE and the BNC along with corpora derived from Wikipedia and the Web. There are three training videos on how to use the Learning Collocations collection in FLAX available in the Training Videos section of this blog.