ghoti

blog

  • archive
  • about
  • Understanding translation through corpus

    Do we really need to know grammar to understand language? What if we can have billions and billions of words of data to work with and put it through a computer? Well, that is pretty much what corpus linguistics is. Here is a nice article about how Google Translate works. It may not be full…

  • What is meta-text information … and how to get rid of it?

    Most of us use word processing software like Microsoft Word, Apple Pages or OpenOffice Writer to create doucments these days. Although we see the words on the page we seldom realise how much more information is included to make the words look like they do: the type of font; size; decorations; etc. This is what…

  • Online Word Document cleaner to plain-text

    Here is a nice quick simple Word “cleaner” script with source code by Jonathan Hedley. Straightforward and does what is necessary. But troublesome if one has hundreds or even thousands of documents to convert. I am still looking for a solution here.

  • Language death and the preservation of Australian indigenous cultures

    Before British colonialisation began there in 1788, around 250 aboriginal languages were spoken in Australia by an estimated one million people. Only a few dozen languages remain and the communities number around 470,000 people in a nation of 22 million. It is often said that language is culture. So 250 languages spoken means the existence…

  • A Very Short Introduction to Corpus Linguistics

    I have a made a short online intro for corpling here. I hope you find it useful.

  • A simple guide to using Antconc

    I have created a file about using Antconc, a concordancing program by Laurence Anthony at Waseda University. You can find it here. The latest version was created on 5 March 2011.

  • Making a text file manually

    To make a text file open a text editor (e.g., Notepad in Windows or TextEdit in Mac). Manually type or paste some text into the main window and save. Saving from a word processor like Microsoft Word may not give a “clean” output, that is, some of the letters or punctuation may not render as…

  • Japan, China, South Korea mull academic credit system

    From the Daily Yomiuri due to the lack of archiving. The education ministry has decided to draw up a new framework in conjunction with China and South Korea to allow universities in all three countries to integrate methods to evaluate students’ academic achievements and certify academic credits. The Education, Culture, Sports, Science and Technology Ministry…

  • How to save a Word 2007 document as a PDF

    Microsoft Office Word 2007 now allows you to save a document as PDF. All you have to do is choose to save it as a PDF in File Type in the Save or Save as dialogue box. If you don’t see it in the File Type dropdown list you may have to get the update…

  • Statistical terms – measurement

    Generally, there are four data types in statistics: nominal, ordinal, interval and ratio. Nominal data as the name suggests is characterize data by name. For example, the categorization of someone as male or female is nominal data. There is no order or rank between nominal data or only difference. Ordinal data is data which can…

←Previous Page
1 … 20 21 22 23 24 25
Next Page→

ghoti

Website Powered by WordPress.com.

  • Follow Following
    • ghoti
    • Join 369 other followers
    • Already have a WordPress.com account? Log in now.
    • ghoti
    • Edit Site
    • Follow Following
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar