How to get a List composed of Hyperlinks onto an excel or word file? - html

So I want to get these Top Ranking lists from the website App Annie which recording ranking information from the itunes App Store on to a standard excel file.
Here is the link: http://www.appannie.com/top/
Each app listed on the Top Paid, Top Free, Top Grossing are all linked to the apps details page. I essentially want to get these top 100 lists on a excel file. I do not want to manually type in the day by day rankings.
But I have no idea how this could be done.
Also if anyone knows a source that simply provides an excel file with the daily App Store rankings that would be even better.
Thank You.

I had luck pasting from the page as a refreshable web query.
In Excel 2010: Go to Paste > Refreshable Web Query. In the window that pops up, load your desired website, and then click the arrow in the top left in order to select the whole page. Then click import.
The resulting paste isn't exactly pretty, but once you remove the garbage from the top and the bottom you have a fairly workable result.

Related

How to copy all text from website + pop up windows?

There's a web page with loads of contact info that I would like to copy to my clipboard and paste into an excel doc. The problem is that in order to display the info (phone, email, title, company, etc...) you have to click on the contact name and then a little blurb comes up (similar to a business card) that shows all info. Thus, I am unable to select all and copy and would have to do this one by one. There are thousands of contacts, so obviously this would be impractical. The hyperlink does not change when I click the blurb.
I'm wondering if there's a workaround to this - is there a program I can use that can extract the info from all of these small blurb-type pop ups that come from one page?
Any help is appreciated.
There is no systematic way to crawl web pages, as it depends on how it was developped.
But you can try this :
Click on one such pop-up.
With the "dev tools" of your browser, inspect it and find anything you can use to identify this pop-up (for instance it may have a certain class or name).
Then, look for this class (or name) in the source code : all the other pop-ups may be there.
If so, you can do some text parsing to get the info you want.
Finally, you can use something like "search and replace" in a text editor to display the result in an Excel-friendly way.

I need to build in offline HTML form for completion with a button the print the data in a table

I am looking to build an 'easy on the eye' offline form which will be saved locally on users machines.
The purpose of the form is to record information and minutes taken in a meeting.
The form will include a logo, a header, hopefully some a dividing strip (preferably blue) and fields to complete in a table
Date
Time
Location
Attendees
An expanding (Downwards) text box with gets bigger when you type enough to fill it.
A print button which gives you the option of saving as a PDF or actually print it.
I have trawled the internet and even tried to adapt source codes on forms I like the look of but I'm not confident that even when I have adapted the code that those forms still don't have have some kind of connection to the internet as the information recorded in the forms will be highly confidential.
Can you point me in the direction or give me some pointers?
I have seen tutorials but the forms look very basic.
Many Thanks

Using VBScript in tandem with Chrome to fill out forms and download files

I need to download a series of credit reports using chrome, has to be chrome because my company uses last pass and I don't actually know the password. I also need to use VBScript with windows default libraries. The script needs to click (and I counted) 2 links -> then a radial button -> then input into 2 text boxes -> select the 6th item in a dropdown list -> click a download button -> then copy a string from the page and rename the downloaded file based on that. This last part I think I could do with windows.xmlhttp get then parse it, but if there's a better way I'm looking for suggestions.
My knowledge of VBscript is ad hock at best, I've had to learn it on the fly on an as needed basis, frankly its a miracle some of the scripts i wrote work at all.
The only solutions I could think of was to use WScript.SendKeys "{tab}" and WScript.SendKeys "{enter}" a few dozen times, this works most of the time, but has no reaction capability and can't copy text from the page and rename it for me. The page also doesn't always have the same number of buttons or links so my tab count will be off at times. My other idea was to use http post and http get but I'm more or less completely in the dark about how those work

click through several links on webpage and extract information from each?

I have access a webpage with a list of several hyperlinks, each of which are unique. This is it:
webpage I have access to that lists names from top to bottom. Each name is unique and is a hyperlink. I would like to click on these hyperlinks one by one and extract info from the resulting webpage.
Say I click the first name, say Adam, it then brings me to the following webpage:
alt:this is a page of the user's profile and includes info such as name, email status and more
My goal is to create a program that clicks the name at the top, and then takes the email address and puts it in an excel spreadsheet on my local machine. And then goes back to the original page, clicks on the name directly below the name that was previously clicked on, and then takes this names resulting email and throws it in the spreadsheet.
for those looking at the pictures, i would like to click on 'Adam' and then put his email into a spreadsheet, then go back, and then click on 'Adrian' and then put his email into a spreadsheet, and so on and so forth down the list.
What do I need to do/learn to create a tool that will do this for me?
I know a bit of Java and a tiny bit of html. I've been trying to look for a solution but the most I can get is excel vba to click on the first name, but not much more. Even if I got the vba to click on all the names, it seems I would have to type in an instruction for the vba to find each unique name, and I dont see much point in doing that since its probably faster to just do this manually then.
As i'm not a programmer (but would like to be soon :)) I have had some trouble asking this question since I don't know any technical terms.
Any thoughts/advice on how to do this?
With javascript and a little php you could make this happen. Since it appears this is something running in the browser it would probably be your best bet.
Make your program click links js has the ability to activate links. You could
click on body load
the first link on the page,
then have a counter that counts each time you click a link.
`Find out how many links there are in the document:`
var x = document.links.length;`
`Get the URL of the first link (index 0) in the document:`
`var x = document.links[0].href;`
Click the link you want to click
$(function(){
$('#myLink').trigger('click');
})
now that your on the next page you need to grab the email address that is on this page. If you know the css just grab the string that is in the element. at this point you can use javascript to go back to the previous page.
<button onclick="goBack()">Go Back</button>
Now you are on the original page. Your counter is one number higher. Use the counter to click the next link and your program will repeat the process pushing the new email to your array.
var fruits = ["Banana", "Orange", "Apple", "Mango"];
fruits.push("Kiwi");
once the program runs out of links to click or hits the limit of your counter you will need to create a txt file with javascript. write your array to the file with PHP. You can do this with an ajax call.
make an AJAX call to your php file passing the array. php can then write to the file.
Here is an example of making a text file and writing to it with javascript
Following this logic you can fairly easy make an application that activates the links in order of a page, pushes the email address to an array as a string. Goes back to the previous page and continues the process till you have all the links. At this point your program will write all the data in the array to a file.

Mimicking Spreadsheet Style in a MS-Access Report

I've been tasked with creating a report in MS-Access that looks exactly like a spreadsheet that a vendor supplies to us for my company to fill in.
The number of records per page is about 40 and there are usually 3-6 pages that need to be prepared. Each month there is a new report sent out and I just got finished writing it all in manually while looking at a report I generated. The purpose of this is to avoid manually transcribing the data.
They are adamant about using their format and will not accept a different report, so I'm trying to be sneaky about it.
Problems
I can duplicate the header of the spreadsheet and the rows just fine, I've just run into a few snags.
Blank rows need to be displayed on the last page of the report instead of nothing being printed (whitespace) and then the page footer.
Whitespace that exists between the Details and the Page Footer is present. The page footer should instead appear to be another row of cells, except that it has the text Page Total and the page total on that row.
The second item happens because the Page Footer always appears at the bottom of the page in a set location as opposed to where the records ended (even if they took up the entire page).
Ideas
If there is someway I could create a
group based on page, then I could
stick that right after the details
section so that it would line up
nicely as opposed to the page total
and still be able to display the page
total.
Inserting blank rows into the rows to
match the number of records, is this
possible? I could calculate how many
extra rows I would need to complete
the page, but how would I insert
those rows into the data source?
Creating a new excel spreadsheet from a template and just writing to there the rows.
I'm using MS-Access 2007 here with a MS-Access 2003 MDB.
Any help is greatly appreciated.
If you need gridlines to print at the end of an Access report, one option is to create a background bitmap that you insert into the report's picture property.
This would be rather fussy, as you could use it only if your headers and footers are identical on all pages, and you'd have to be sure that controls entirely cover the whole detail area so that the background graphic will not show through except on pages where there is blank space. Also, if you altered the width of your detail fields, you'd need to edit the graphic to harmonize with those changes.
Let me just say that I consider the insistance on replicating the look of the spreadsheet to be incredibly boneheaded stupid. What purpose is served by these gridlines except to replicate the visual appearance of a spreadsheet? Are they going to use the grid to write things in? If not, then it's just a really idiotic requirement.
Start by turning a copy of their Excel report into a template file. Remove the data, but keep headers, formating, and formulas as needed (Some data manipulation will be easier in Access.).
This way you can enter and store data in Access. Instead of having users fill-in the spreadsheet in Excel with VBA based on the template file.
You'll run into different issues of how to place the results of a query to a worksheet and filling in formulas in specific fields, etc., but those can be later questions to post.