How to download the wikipedia articles that are listed in PetScan tool? - mediawiki

I had shortlisted a list of Wikipedia articles using the Petscan tool. Below is the link https://petscan.wmflabs.org/
I have used "Diseases & disorders" category from wikipedia with a depth value of 2. Approx 10000 articles were listed in the results.
My question is how do I download the articles to my computer. I am new to these things so need help.

I think I figured it out with the help of comment from #Tgr above. I navigated to output options and found a bunch of formats for exporting the file. They are as follows,
HTML
JSON
CSV
Pagepile
Simply choose the option and get the required output.

Just navigate to Output tab, and choose the format you need to export to, then click Do it. After that, the data will be downloaded to your PC.

Related

Way to get detailed listing of all Pages properties in OneNote?

I have the latest OneNote on Windows 10. I'm trying to create a list of all my Pages in all the Notebooks. One line per Page, showing it's Name, Section and Notebook plus created date. As it's for import into (say) excel, be nice to have it delimited, say with # chars.
Nice Wines#Entertainment#Home Notebook#2021-10-01T06:57:32+10:00
I really only know Python and PowerShell. Is there any sample code you can suggest or another way to get this? Thanks.
I installed this PowerShell module and then another Stack Overflow user helped me get over another hurdle when using it.

PHPStorm Find and Replace in All Files Except One

Sometimes when I do a find and replace for a string in my project, I'd like it to be committed to all directories / files except one. Is there a way to exclude a particular file in the GUI screen that appears before you commence your search?
This link might help you.
No need to upvote or accept my answer if it works for you :)
Exclude folder from search but not from the project list
Or this, using scopes: How to exclude files/folders from code inspection in PHPStorm?

Trouble Getting a Locally Hosted Copy of the English Language Wiktionary to include the Translations Sections

I used MWDumper - http://www.mediawiki.org/wiki/Mwdumper - to import the xml dump of the English Language Wiktionary (specifically the file named enwiktionary-20120930-pages-meta-current.xml,) to my local server.
I have found that under the Translations section (on each page for each English word,) next to the name of each language where I should be able to see the definition in a foreign language, I instead see Template:Tø, Template:T+, or Template:T- and I am not sure why this is.
As an experiment, I also used WikiTaxi - http://www.yunqa.de/delphi/doku.php/products/wikitaxi/index - with the exact same XML dump and did not have this problem when viewing under WikiTaxi.exe.
I have been searching through mediawiki.org looking for the answer, but have so far not been successful.
Okay, I found out that MWDumper did the right thing importing the xml dump. All the translations are there. I just had to click on the Template:T+, Template:T- and Template:Tø links and add a template according to the instructions at http://www.mediawiki.org/wiki/Templates.

MediaWiki API: How to get an article's list of editors

As part of the WP:ASE project, I want to get the list of editors that have edited a given article.
For instance, for the article Szklarka Mielęcka (history) that would be:
Kotbot, AnomieBOT, Xenobot
I could not find anything in the MediaWiki API.
Any better idea than scraping the history web page?
Downloading the history data dumps is not a solution because I don't have the resources to handle 5 terabytes of text.
Scale: I want to do this on about 1000 random articles, twice a year.
I have found:
http://en.wikipedia.org/w/api.php?action=query&prop=revisions&titles=Sinjhoro&rvprop=user&rvlimit=500

How to search a word in a html file without any java coding?

I'm doing a project in Java which creates a user manual (html files that are linked together like Windows "Help and support centre") of software. Now once a user manual is created I have only html files remaining. Now I want to search html file that contains specified keyword(Search Engine).How can I do this without Java code??
grep, find, python script, or open any file with a text editor and try edit->search
(on windows use windows search in file)
If all of your other code is written in java, then it'll be sensible (without knowing your usecase) to use java for searching as well. You might of course use some commandline programs as grep or find - or built in search functionality in a webbrowser, but if the search should be part of a java application anyway, why not go for java and e.g. Lucene?
If this 'help' is going to be online than you can embed google search in it (limiting the search results to specified site:). Alternatively if you're hosting the pages yourself you can use htdig for indexing the pages.
However if it's going to offilne you'll be better of by generating a static index page with links to topics. In order to create a more help-system-alike user experience you can hide the contents of the index in the invisible html DIV tags and add a JavaScript that takes searched phrase as an input and that unhides the matched words with their links.
Maybe I'm missing something, but have you looked at javahelp? It has indexing and searching built in, and can be used online or offline.