What is the easiest way to get some data from wikipedia? I would like to get it as CSV file.
Basically the data what I would like to get is just list of names. For example, all the British actors names from this page: http://en.wikipedia.org/wiki/List_of_British_actors_and_actresses
(All from A-Z and names would be enough).
Is this possible? Also this would be done only once so no need for caching or anything like. Just simple get data perform. But I have no clue how to do it really.
PHP, JS, Jquery, JSON would be nice. No java or anything like that!
Have a look at DBPedia and Google Refine. IIRC Google Refine had an example extracting and cleaning data from Wikipedia (see video tutorial). And DBPedia is a database copy of Wikipedia already.
Related
I'm new in html.
how do i convert a .csv file into a table to show in html.
conditions.
the html needs to extract the .csv content and show it on table form on chrome.
the headers are fixed but the data will be updated every week, so html should be able to extract the correct data. only need to update the .csv file.
either you could do it like Ashik said and switch to a database and load everything via php or another programming language, or you try the direct way. Because of how you asked the question I think you don't have any database or knowledge of other programming languages.
Try to use a javascript library, for example jquery and one of its many plugins to achieve this.
Try this for example: https://code.google.com/p/jquerycsvtotable/
I havn't tried it, but it looks promising, maybe it helps you.
I am building a website with ROR 3. I need to provide a page to my clients wherein he could edit his pricing info regarding the application. I am quite confused on how to do this. The pricing page needs to be displayed as an html table with different columns which has got the pricing info.
I am thinking of different ways to do this.
1) Allow the client to create and upload an html page and then save it as a file in my public directory and render as an when the client clicks on the pricing link.
2) The clients may not have bare technical knowledge, hence make the client upload some other formats like Word, Excel etc and then parse it and store it as an HTML file in the public directory.
3) Provide the client with some real time editing tools where in the client could edit in a fixed format, and after wards save the file and render it later.
Also, I wouldn't like to store these infos in my database. There would be quite a few number of clients and hence managing all these data in my database would become cumbersome. Storing all these as plain html files and rendering it later would be the most ideal thing for me.
There might be other better steps in doing this as well. Could you please suggest which might the better, or any other option that could suit my needs? Basically I would want my clients to have a mechanism where they could provide there pricing details, edit it later and display it back as an html table, all this without using an Database backend. Any suggestions would be mostly appreciated.
Good way is Excel(csv format).
You can do PHP with Excel. I thing this is the best solutions for your requirement.
Try this.
http://php.net/manual/en/function.fgetcsv.php
If you are give authority to user to change edit contain and you have to used " CSV or Excel" please see these links:
Importing CSV and Excel
Exporting CSV and Excel
If you really don't want to use database then you can use YAML as a structured storage.
e.g. ( you, most probably, could come up with a better structure )
SMS_Pack:
Sl_No:
1: 10000
2: 25000
3: 50000
You can read those .yml files and parse them as hashes. Should be fairly easy to represent that hash as a HTML table.
For the creation, I'm sure you can come up with some dynamic form input. Or to just let the client send this kind of file ( which might not be the best solution ).
But it just might be easier to manage all of this information within a database.
I want to store file data of a directory in a file. i.e., file name, file size etc so that I can reduce search time. My problem now is to find an efficient way to do it. I have considered json and xml but can't decide between these two. Also if there is a better way let me know.
I'd say that it's up to what kind of data you prefer to work with and to what structure of data you have (very simple as a list of word, less simple as a list of word and the number of time each word was searched,...)
For a list of word you can use a simple text file with one word per line or coma separated (csv), for a less simple structure, json or xml will work fine.
I like to work with json as it's more light than xml and less verbose. If you didn't plan to share this data and/or it isn't complex, you don't need the validation (xsd,...) offered by xml.
And even if you plan to share this data, you can work with json.
You'll need some server side code to write the data to a file, like php, java, python, ruby,...
I would recommend Json file if you use alomst like a properties file.
If you plan to store the data in the file into database then you can go for XML where u have to the option to use JAXB/JPA in java environment
I have a commercial ColdFusion application, running on a MySQL database. A possible new client has approached me, they have been working in a Lotus Notes environment (and their own database) for many years now. Ofcourse they want to migrate their data to my application, before making the move.
I'm trying to get a grip on how to get a thorough feeling of the data, structure and interdependencies in their current database-application. Are there any tools to see a database-structure (like in a RDBMS) of a NSF-file, or is there anyway to dump the structure using ColdFusion etc....I don't have any handson experience with Lotus Notes (I do in the meanwhile have a local Lotus client and their database).
I need a good startingpoint to be able to determine whether or not I can find a way to migrate the data.
Any ideas??
thanks
Bart
To get at the data in Notes, a good option is to use NotesSQL which can be found here:
A quick overview of the Notes data structure is this: Notes is a document-centric database, with non-relational data contained within each document. Notes Databases (NSFs) contain any number of Notes Documents, which in turn contain any number of items that hold data. Each Notes Document can have a different set of items, and thus different data in it. While that sounds like a horrible mess, usually the documents have similar data based on the form used to create the documents.
This all leads to why there is no simple way to get data out of Lotus Notes. There are a few other options, which may or may not be useful depending on how much data you have to migrate.
I personally like using XML to extract data from Lotus Notes. You can do so by creating XML views within a Notes database. IBM has a tutorial that looks helpful.
Using Java or LotusScript, you can write code to extract data from the documents to any format you wish (CSV, XML, TXT, etc)
If it's not a lot of data, you may find getting the data into an Excel format is the simplest intermediary step. Long ago I wrote an add-in tool for exporting data from Lotus Notes to Excel, which may help you. Or you can use the "Edit > Copy Selected To Table" feature in the Lotus Notes client to copy what is visible in a Notes View to the clipboard, and then paste that into Excel. In that scenario, you'd want to edit the views so they show all the data you need.
I hope this helps!
I'm trying to migrate from a custom designed blog software system to a WordPress.com site. I can access my MySQL database of posts and comments without too much difficulty, thanks in part to this post: How do I connect to a MySQL database from Clojure?.
I think my next step is to generate the pseudo-xml file known as a "WordPress eXtended RSS file", and that's where I'm having difficulty. I can navigate my way through the recordsets returned from my database connection (as of now, I'm doing two SQL searches, one for posts, and another for comments, giving me two recordsets), but I don't know what sort of data structure I should put those results into in order to eventually export them into the xml file.
I've been using a lot of file spit/slurps while working my way through this, but I have a feeling that's neither the most efficient way to do it, nor the most "lispy" way.
I think what I need is help defining a clojure-based data structure that I can append both new posts and new comments to, iteratively, for eventual export into the xml file format I need it to finally get to.
Look at clojure.xml/emit and clojure.xml/emit-element; they expect data that look as described in (doc clojure.xml/parse) and walk this data to output XML.