Alternatives to Deprecated Google News API - json

What are some alternatives to the comprehensive and now deprecated Google News API?
I'm trying to load some JSON data from two government webservices to be parsed into a query for a news API which will send me back hefty relevant news.
It will be over economic and regional data. So it could be lowest paying job in dallas county.
Are there any news api with as much functionality?

This is a relatively old question, but there have been lots of recent developments in the News API space and I thought I could shed some light.
I’m going to immediately disregard the more expensive choices like Bing ($7/1,000 requests), since most people won’t find it easy to afford them, and the results are actually often inferior to more specialized solutions.
That means you’re basically left with two options: News API and newsapi.org. I really do enjoy newsapi.org, they do a fantastic job and provide high quality search results. They also provide developers with 500 free requests per day, which is pretty great.
ContextualWeb News API does have a couple of advantages: they offer 10,000 requests per month free, and after that our API is significantly cheaper and more flexible (the baseline is $0.5/1000 requests, compared with a slightly higher $1.8/1000 requests with newsapi.org’s basic plan). ContextualWeb also offers result keywords, allowing you to do all kinds of nifty machine learning stuff with the search results.
Having said that, newsapi.org’s documentation is currently more straightforward, and their results are mostly always spot on.

Check out the Social Animal NEWS API.
The news database has 450 million-plus articles and 1 million articles are added on a daily basis.
Notable features:
It is fast and provides breaking news across the web in 39 languages.
Social engagement data is available for all news URLs which enables surfacing top quality articles
Offers Sentiment Analysis to filter articles based on their sentiments.
For more details, please visit the documentation page.
Disclaimer: I work for Social Animal.

There's a free alternative that provides good data but the quantity of query is limited.
If you want to get a higher number of query per day you must pay a subscription.
The site is called gnews.io, it's really simple to authenticate and to get a key.

You could use a general news api site like newsapi.org or if you want more specific niche news (e.g. stocks) you can search for niche api's like iex, alphavantage or stocknewsapi.com

Aylien provides a News API that gives you access to NLP-enriched news articles from 80,000+ news sources: https://aylien.com/product/news-api/demo

The Newsdata.io API is a simple news API with this you can search over 50,000 news data sources worldwide. You can use this API to get live-breaking news from any country in the world in your preferred language as they provide news data in 22 languages in 7 categories from 88 countries.
Newsdata.io allows you to search for published articles using keywords or phrases, languages, publications, time. You can sort the results by time, the popularity of the publication source, or location.
Newsdata.io also provides news data analysis and it includes data analysis like topic labeling, intent detection, sentiment analysis, emotion analysis, entity extraction, semantic similarities.
The Newsdata.io API is free to use for non-commercial purposes with 500 API requests per day with 10 articles per request, 99.99% SLA uptime.
Data format: The API sends quick GET HTTP requests and returns JSON results.
Ease of use: The API is simple to use. Furthermore, you can use its documentation to get started implementing the API in a matter of minutes.

Maybe you could try Google BigQuery

Related

Is it possible to return place reviews mentioning certain keywords in the text?

I know I can return a JSON array of up to 5 reviews. If I can return just 5 reviews that include only certain keywords that would be fine. Is it possible?
If not, perhaps I could generate a request that returns all the reviews for a place, then perform the keyword filter on the results.
I understand the business owner may return all reviews via My Business API, but I will not have My Business access for the places. I believe I can get more than 5 results with a premium plan, but assume that will get pretty expensive.
You are right the only way to get all reviews is via Google My Business API.The Places API at the moment provides only 5 reviews doesn't matter Premium or Standard plan.
There are a couple of feature requests in Google issue tracker you might be interested in:
https://issuetracker.google.com/issues/35825957
https://issuetracker.google.com/issues/35821903
Note these requests are pretty old, Google didn't set high priority on them. You can star them to express your interest and subscribe to further notifications.
Hope this answer clarifies your doubt.

How to use Wikipedia API to get page statistics for all pages in a Category?

I am looking to identify the most popular pages in a Wikipedia Category (for example, which graph algorithms had the highest page views in the last year?). However, there seems to be little up-to-date information of Wikipedia APIs, especially for obtaining statistics.
For example, the StackOverflow post on How to use Wikipedia API to get the page view statistics of a particular page in Wikipedia? contains answers that no longer seem to work.
I have dug around a bit, but I am unable to find any usable APIs, other than a really nice website, where I could potentially do this manually, by typing page titles one by one (max. up to ten pages only): https://tools.wmflabs.org/pageviews/. Would appreciate any help. Thanks!
You can use a MediaWiki API call like this to get the titles in the category: https://en.wikipedia.org/w/api.php?action=query&list=categorymembers&cmtitle=Category:Physics
Then you can use this to get page view statistics for each page: https://wikimedia.org/api/rest_v1/#!/Pageviews_data/get_metrics_pageviews_per_article_project_access_agent_article_granularity_start_end
(careful of the rate limit)
E.g. for the last year, article "Physics" (part of the Physics category): https://wikimedia.org/api/rest_v1/metrics/pageviews/per-article/en.wikipedia.org/all-access/all-agents/Physics/daily/20151104/20161104
If you're dealing with large categories, it may be best to start downloading statistics from https://dumps.wikimedia.org/other/pageviews/2016/2016-11/ to avoid making so many REST API calls.
TreeViews is a tool designed to do exactly this. Getting good data is going to be hard if your category contains thousands of pages, in which case you'd better do the calculations yourself as Krenair suggests.

Foursquare Explore API not providing venue tips on certain accounts

When using the Foursquare Developer API to perform a venue search, I receive different results depending on the account being used.
For a given authorised query (for example, https://api.foursquare.com/v2/venues/explore?ll=40.7,-74) I am given back a collection of items with "reasons" and "venue" objects. On some accounts, it also provides "tips" data; on others it does not.
Is this additional information dependent on certain account settings, or is there something I'm missing?
Thanks in advance!
The explore endpoint is tailored to each user and provides dynamic recommendations depending on social context, time of day, and user interests. Thus, two queries are likely to contain different content depending on who is querying the API.
For example, if a friend has left a tip at a nearby venue, it's more likely that you'll see that in your explore results, because it provides social reinforcement to the recommendation.
For other venues, there might be signals more important than tips, so a tip is not surfaced.
The official foursquare apps provide good examples of this data in action, as the explore result panes change format slightly from recommendation to recommendation.
Hope that helps?

How to create indeed.com like search?

If you have used indeed.com before, you may know that for the keywords you look for, it returns a traditional search results as long as multiple search refinement options on the left side of screen.
For example, searching for keyword "designer", the refinement options are:
Salary Estimate
$40,000+ (45982)
$60,000+ (29795)
$80,000+ (15966)
$100,000+ (6896)
$120,000+ (2828)
Title
Floral Design Specialist (945)
Hair Stylist (817)
GRAPHIC DESIGNER (630)
Hourly Associates/Co-managers (589)
Web designer (584)
more »
Company
Kelly Services (1862)
Unlisted Company (1133)
CyberCoders Engineering (1058)
Michaels Arts & Crafts (947)
ULTA (818)
Elance (767)
Location
New York, NY (2960)
San Francisco, CA (1633)
Chicago, IL (1184)
Houston, TX (1057)
Seattle, WA (1025)
more »
Job Type
Full-time (45687)
Part-time (2196)
Contract (8204)
Internship (720)
Temporary (1093)
How does it gather statistics information so quickly (e.g. the number of job offers in each salary range). Looks like the refinement options are created in realtime since minor keywords load fast too.
Is there a specific SQL technique to create such feature? Or is there a manual on the web explaining the tech behind this?
The technology used in Indeed.com and other search engines is known as inverted indexing which is at the core of how search engines work (e.g Google). The filtering you refer to ("refinement options") are known as facets.
You can use Apache Solr, a full-fledged search server built using Lucene and easily integrable into your application using its RESTful API. Comes out-of-the-box with several features such as faceting, caching, scaling, spell-checking, etc. Is also used by several sites such as Netflix, C-Net, AOL etc. - hence stable, scalable and battle-tested.
If you want to dig deep into facet based filtering works, lookup Bitsets/Bitarrays and is described in this article.
Why do you think that they load "too fast"? They certainly have nice, scaled architecture, they use caching for sure, they might be using some denormalized datastore to accelerate some computations and queries.
Take a look at google and number of web pages worldwide - you also think that google works too fast?
In addition to what Mios said and as Daimon mentioned it does use a denormalized doc store. Here is a link to Indeed's tech talk about its docstore
http://engineering.indeed.com/blog/2013/03/indeedeng-from-1-to-1-billion-video/
Also another related article on their Engineering blog:
http://engineering.indeed.com/blog/2013/10/serving-over-1-billion-documents-per-day-with-docstore-v2/

Using the CloudMade API or some other geosearching API to get tourism spots?

I've been looking for an online geosearch API such that I can get location data for tourism spots, e.g. historic landmarks, tourism information locations, castles, etc. I'm not looking for a map image, but rather a list of location tagged data. I've played with CloudMade, but none of the object types seems to returning anything like the number of reponses I expect (even with a wide bounding box). None of Bing, Yahoo or Google Maps seems to have a geosearch API w/o the maps -- just geocoding. I've seen apps that find restaurants open 24-hours or apartments for rent near me -- where does that day come from? Thanks!
Where does that data come from:
Nearby restaurant searches are often performed with the Google AJAX Search API using its Local Search mode. It doesn't work for performing vague searches like "tourism spot" or "historical landmark", it needs something more specific like "museum". It's fundamentally a business search (they initially bought data from Yellow Pages, so it's heavily biassed towards organisations that have a telephone) so it won't find natural features. It's also limited to about 8 replies per query.
Apps that find apartments for rent tend to have their own database, and that database is built by reading things like Craigslist.
CloudMade uses limitation on the returning results not to hurt your browser. By default it returns 10 result. You could use "limit=" and "offset=" for paging. And the response contains "found" field which gives you the total number of results.
BTW, couple of weeks ago new CloudMade Geocoding API was published. It is far faster and easier to use.
More info - http://blog.cloudmade.com/2009/10/08/new-geocoding-engine-delivers-results-up-to-24-times-faster/
Documentation - http://developers.cloudmade.com/wiki/geocoding-http-api/Documentation