Finding number of pages of a website - html

I want to find the number of pages of a website. Usually what I look for is a sitemap but I just encountered a site which does not have a sitemap so I am out of ideas of how to find its total pages. I tried to Google the URL but that did not helped much. Is there any other way we can find out the pages of a website?
Thanks in advance.

Ask Google "site:yourdomain.com"
This gives you all indexed pages.
Or use the free tool "Xenu". It crawls the whole site. But it won't find sites which have no internal links pointing to them. You can also export a sitemap with it.

I was about to suggest the same thing :) If this is a website you own, you can also add it to the Google Webmaster tools. It will show you lots of things about your site including number of links, pages, search terms, etc Its very useful and is free of charge.

I have found a better solution myself. You can go to Google Advanced Search and restrict the search results to your domain name. Leave everything else empty. It would give you the list of all pages cached by Google.

You could also try A1 Website Analyzer. But for all link checker software, you will have to make sure you configure them correctly to obey/not-obey (whatever your needs are) e.g robots.txt, noindex and nofollow instructions. (Common source for confusion in my experience.)

Related

Html and css website not showing up on search engine

I have a website that I made in html and css. I registered the domain and I am hosting it using googledrive if you go into a browser and type in (websitename).com in the url it works but if you type it into google or bing search engine it dosnt come up. There were about 20 things that came up and none of them were my site. I am using godaddy.com for my domain name. Do I have to enable something? What am I doing wrong?
A few things you should know:
You need to go into the Google Search Console and Bing Webmaster Tools (https://www.google.com/webmasters/tools/home) and add your website and submit a sitemap
Websites aren't crawled and indexed immediately, it takes time. Also, your website may never rank, it depends on how relevant the search engines determine your content is (See this article on SEO http://searchengineland.com/guide/what-is-seo)
Also, you should post this in the Webmasters community, not here.

webpage in public_html don't show up in google search

Is there a way to get my webpage which is stored in my public_html folder on my server to show up in google? Right now I don't get any hit when searching the title.
Sign Up for Google Webmaster and list your site there, here is the link...https://www.google.com/webmasters/
http://moz.com/beginners-guide-to-seo
Basically what you need to do is turn your page into a SEO beast. SEO stands for Search Engine Optimization. You need to follow googles guidelines:
https://support.google.com/webmasters/answer/35291?hl=en
You need to add certain tags and best practices to your website to get googles bots to scan your site better and get a better understanding of how your website is setup. #Guarav Genius also gave you a link to google webmasters which should provide some more information on how to get your website to show up on google.

What are dpuf (extension) files?

I have seen this extension in some urls and I would like to know what they are used for.
It seems odd, but I couldn't find any information about them. I think they are specific for some plug-in.
It seems to be connected to 'Share This'-buttons on the websites.
I found this page which gives a quite comprehensive explanation:
This tag is mainly developed for tracking the URL sharing on various Social Networks, so every time anyone copies your blog content there he gets the URL ending with #sthash and extension with .dpuf or .dpbs

Google results only showing links of my main page

I have a website futbolpasionatlanta.com that has several pages that are internally linked. For some reason my google results are all showing my linked websites but my main page the index.php or just www.futbolpasionatlanta.com is showing up in the results.
Any ideas what I can do to correct this?
Is is something I would change on in my head tag?
Thanks,
If you want to encourage Google to crawl and index deeper into your site you should try to get incoming links directly to those inner pages. The higher the quality of those links, the better.
It might take some time before Google index your whole site. It will eventually be indexed.
For your next question:
You can use robots.txt and submit it in webmaster tools if you haven't done that yet. This can be used to block some pages:
http://support.google.com/webmasters/bin/answer.py?hl=en&answer=156449

HTML: How to get my subpages listed on a google search

When you go to Google and perform a search, it will return either one of two type of results:
just the title of your webpage, or
the title of your web-page plus, lists subpages it found on that web site
Here is an example of option #2: http://37assets.s3.amazonaws.com/svn/grub-ellis-googlelisting.png
My website on a google.com search only lists my web page title (option #1), how do I get google to list my subpages on the search results (option #2)?
Is is an HTML issue? How do I get Google to know what my subpages are so that it can also list those on a google search.
Those are called "sitelinks" and are automated but you can partially configure them in Google's webmaster's tools. In webmaster's tools, click "sitelinks" in the navigation menu on the left. From the sitelinks page:
Sitelinks are links to a site's interior pages. Not all sites have sitelinks. Google generates these links automatically, but you can remove sitelinks you don't want.
Here is another Google page explaining sitelinks.
You should add a site-map using the Google webmaster tools site, or by maintaining your own. For explanation check out Sitelinks page.
Google has not generated any sitelinks
for your site. Sitelinks are
completely automated, and we show them
only if we think they'll be useful to
the user. If your site's structure
doesn't allow our algorithms to find
good sitelinks, or we don't think that
the sitelinks are relevant to the
user's query, we won't show them.
However, we are always working to
improve how we find and display
sitelinks.
You can also directly enable sitelinks (you don’t have to get lucky) in Google’s Pay-Per-Click platform (AdWords), and it will have a similar very positive impact on your clickthrough rate.
You need to create XML sitemap. Here is all you need to know. Check if your open-source CMS has plugin/add-on/module to do this automatically, there must be generators somewhere too.
http://www.google.lv/search?q=XML+sitemap
http://en.wikipedia.org/wiki/Sitemaps
http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156184
You are describing "Search Engine Optimization" with your question. If you have a small site, the best thing you can hope for is to ensure every page has a unique title, links back to your home page, you have a good "site map" so search engines can easily discover ALL of your pages, and most important, your pages are THE definitive place for information about whatever you're selling.
Content is king and once you become the authority, your page will pop up in the 1st 1-2 links.
Contact some local SEO folks in your area and ask for a site evaluation. Many will do it for free with their automated tools. You can use the webmaster tools from bing or google if you're on a tight budget.