English Words for online dictionary MySQL - mysql

I'm just building an online dictionary English to malayalam, Malayalam to english dictionary.
Here is the link http://www.vanmaram.com/
That website offers the user the option to add words.
I would like to add some words.
If anyone knows where to get english words, could you please give me directions.
It would be very helpful for me

There are meany open dictionary files you can find on the internet, i would recommend using the ones from open office or something like that. They also have a Malayalam one.
Open office allso refers to a more up to date list of data from aspell.net.

you can check this out too : MySQL formatted english words list
dunno much about how many words are not in but it's a huge list anyway :)
There are about 128/129K words in

Related

How to get the name of Wikipedia article in other language

I'm trying to extract English Wikipedia articles related to a list of Arabic articles, let's say I have this article
https://ar.wikipedia.org/wiki/%D8%A7%D9%84%D9%82%D8%AF%D8%B3
and I need to extract the English version of it
https://en.wikipedia.org/wiki/Jerusalem
The problem is that I don't have the list of English names corresponding to the Arabic names list to use it with the API's, I think about extracting the language links and do some processing on the result to extract the English names but don't know if there is any easier way using the Mediawiki API's that you can suggest?
The easiest way to do this is through wikidata. There's a wikidata item for all the wikipedia pages, and wikidata has links to all the wiki pages about the particular term. So, here's an example query that would give you the English name from Arabic name:
https://www.wikidata.org/w/api.php?action=wbgetentities&sites=arwiki&titles=%D8%A7%D9%84%D9%82%D8%AF%D8%B3&languages=en

A good practice to work with dictionaries?

I'm starting do develop a game (AS3), and in one step, the participants have to type a word in one of 5 different available languages, and then that word is translated to the other 4.
For the sake of example:
I choose the word "home" in English, and then these fields are filled:
Spanish: casa
Russian: домой
German: Zuhause
French: maison
So the question is, what would be the best approach to do it?, are there any downloadable dictionaries available for different languages?, or it would be better to feed from a web service?.
Also something to consider is that the translations shouldn't consist of more than one word.
I never worked with dictionaries before, so I'd rather investigate a bit instead of starting with the left foot. Thanks.
You have to use property file. This is best approach to do multi-languaged application.

Import Wikipedia - get matching articles in different languages

I have imported Wikipedia database dumps (pages-articles.xml versions) of 2 respective languages, English and Maltese.
However I need to be able to link an article from one language to another. I am under the impression that I have to import additional tables for this. Which tables do I need to import?
Thanks in advance!
Maltese db dump repository: http://dumps.wikimedia.org/mtwiki/20121012/
English db dump repository: http://dumps.wikimedia.org/enwiki/20121001/
That information is in the langlinks table, so you will need to download langlinks.sql.gz, which is a SQL dump, not XML.
One possible issue is that those links may not be symmetric, so for example en:A may link to mt:B, but mt:B links to en:C. You'll have to decide what to do about that.
If I understand correctly, you want to create an English wiki and a Maltese wiki, and to have them link to one another.
To do this, you need to add the interlanguage prefixes for 'en' and 'mt' to the interwiki table. Here's a description of it:
https://www.mediawiki.org/wiki/Interwiki_link
You should probably remove the links to all the other languages from the articles. Otherwise they will show as junk at the bottom of a lot of articles.
P.S.: It's great to hear that you are working with the Maltese language! I really love it :)

Best Way to Implement Natural Language Search on a Site

I implemented a basic search on my site using a like clause in MySQL. But it doesn't help in many cases.
I have a search I am testing with: "swift bird"
The entry in the database is: "Swift"
What do people usually do in order to catch as many of the possibilities, abbreviations, and variations of the words they need to find when implementing their own basic search on their site?
If you want to test, here is the url for this:
http://www.comehike.com/outdoors/birds/search_birds.php
Thanks,
Alex
Have you investigated MySQL's Full Text Search capabilities?
http://dev.mysql.com/doc/refman/5.5/en/fulltext-search.html

OCR and word reviewing

I'm using Tesseract for my letter recognition project and currently the recognitions is quite good. The image processing part was done using OpenCv libraries.
The letters are hand written.But there are some problems when I used it to recognise the letter "O" and number "0". These letters are used in data areas as the fields that enter names. So names cannot have any numbers with it. And when we are using the the system of the data fields as date of birth it only contains numbers. So I'm willing to give restriction to the recognition system saying that the corresponding data fields have only numbers or the letters.
And also I'm willing to review the recognised letters with the possible words so we can improve the accuracy of the data. I'm willing to use the openCv libraries for this task. But I don't know what are the libraries that help for this task and what are the functionalities of those. So please can some one help me. Thank you.
Regards,
Thilanka.
I've never used Tesseract. However, in the FAQ it says
How do I recognise only digits?
TessBaseAPI::SetVariable("tessedit_char_whitelist", "0123456789");
Presumably you could use the pattern of the FAQ entry to set it up so it only recognises letters or just digits appropriately.
If you have already tried this, can you give more details of why it doesn't work?