Hi Igniters,

 

We have completed the SEO audit of the Ignite website which was announced
earlier this year. Many issues were identified and most have been resolved.
There are some open issues that represent opportunities to further improve
the website, some of which we will continue to pursue.

 

For those who would like to know what we found, what we fixed, what is still
open, and the recommended steps for maintaining the site going forward, I
present the following summary of the SEO audit findings:

 

Identified Issues and Recommended Resolutions

 

1 - Canonical Issues

These two canonical issues contributed to Google being able to see and index
full duplicate versions of nearly every page on the site. For that reason,
this issue had the highest severity. 

 

Both the HTTP and HTTPS version of the site were resolving and neither had
canonicals placed on them.

*         Recommendation: Ignite uses the HTTPS version as the canonical
version of the site, and redirect the HTTP version to the newly chosen
canonical version of the site. - Completed 

URLs are lacking canonical tags site wide

*         Recommendation: Add canonical tags (mainly self-referencing tags)
to all pages that Ignite wants to be indexed. - Completed 

2 - URL Crawl Bloat

The API documentation URLs accounted for the largest page directory on
ignite.apache.org. Most of these pages were orphaned, but Google had found
and indexed around 12,000 of them (likely from articles or webinar links).
These pages are nearly identical across various releases with only small
changes, causing significant confusion for Google to decide which version to
serve. 

*         Recommendation -  Noindex all versions except for the latest
version of the documentation. In order to allow these documentation pages to
be readily accessible to visitors who may want them but not contributing to
any duplicate content issues on the site, Ignite should allow users to
navigate to older documentation without Google spending too much time
crawling and indexing so many of those pages. This also creates a clean
landing page structure for the cases where someone is looking for current
useful documentation.  - Completed 

3 - Meta Tags

Meta Descriptions- No meta descriptions were found on any pages. 

*         Recommendation: Determine which pages need meta descriptions and
create them. Meta descriptions should be unique and created to draw users in
from the SERPs. - Not Started

H1s

*         Most pages do not have H1 tags. H1 tags have a limited effect on
SEO, but have a strong effect on user experience metrics like bounce rate.


o    Recommendation: For every non-documentation page, take the existing H2s
and turn them into optimized H1 tags. To take it a step further, optimize
these new H1s so that they accurately describe the content of each page.  -
Completed 

*         Visitors who land directly on a feature
<https://ignite.apache.org/features/datagrid.html>  or use cases
<https://ignite.apache.org/use-cases/caching/database-caching.html>  page
from Google may not know that the page is a feature or use cases page.

o    Recommendation: Communicate to the visitor that they are on a feature
or a use cases page via the H1 or an H2.

Title tag

*         The ignite.apache.org homepage needs a new title tag. Seeing as
title tags do have a minor effect on a page's ranking ability, a title tag
of "Apache Ignite" is not descriptive and will not rank for any head terms. 

o   Recommendation: It would be useful to add "open source" and "in-memory
data memory fabric" terminology. - Completed 

*         Title tags site wide are generally poor. 

o   Recommendation: Title tags should be reviewed on a page by page level
and optimized so that they include key terms and accurately describe the
page. It is recommended that the most important keyword is to the left most
position of the title tag. For example, the title tag for the news page went
from "Apache Ignite - News" to "Latest News - Apache Ignite". - Completed 

 

4 - Sitemap

The XML sitemap was missing many pages and contained some non-important
pages. 

*         Recommendation: Update the sitemap to include all important URLs
and remove pages that 404 or pages that shouldn't be submitted to Google. -
Completed 

 

5 - Homepage Links

Some of the text on the homepage uses images rather than regular body text.

*         Recommendation: Use regular body text instead of an image. A user
should be able to copy and paste this text. This allows Googlebot to
understand what the text is about. - Completed 

 

6 - Landing Page Content 

Content on most landing pages is light - most landing pages have a lot of
potential, but generally contain one image, two paragraphs and then a large
glossary of related terms that links out to the readme.io documentation.

*         Recommendation: Deepen the level of content available about the
various features and use cases on the actual page. The glossary of features
and the related descriptions is helpful, but sharing details about those
features with some context would be much more helpful for a user on the
page.  - Not Started 

7 - Related Domains

A few content issues on ignite.apache.org and related domains were
identified. 



Apacheignite.gridgain.org 

*         Has a variety of content duplicative to that on
apacheignite.readme.io. There might also be an issue with hosting all the
ignite content on a Gridgain domain. (Note: this was not gridgain.com, but
gridgain.org)

*         Recommendation: Remove this site entirely. - Redirected links on
gridgain.org to deprecate the site

apacheignite.readme.io

*         There is a lot of content on the readme.io site that could be
beneficial to have on the ignite.apache.org site.

o    The drawback of readme.io is that it is not on Ignite's domain.

o    This site may compete with ignite.apache.org on key terms

o    Has content duplicative to that on ignite.apache.com

*         Recommendation: Consider using some of this content to improve
some of the top-level landing pages on the ignite site.   - Not Started

 

8 - Blog Content

Blog content lives off-site - The blog content related to apache ignite is
generally posted on other sites with the blog page linking out to them. Some
blog posts are on the apache.org domain (ex:
https://blogs.apache.org/foundation/entry/the-asf-asks-have-you) but no blog
posts are on the ignite.apache.org domain. - No Resolution Possible

 

 

Recommended Ongoing Performance Checks

 

The following metrics should be monitored on an ongoing basis: 

 

*         Indexation: Perform a site query search in Google to see how many
of your pages are in Google's index. For example, run site:ignite.apache.org

*         Backlinks: You can run your backlinks through Ahrefs, Majestic
and/or Open Site Explorer to make sure spammy sites are not linking to you
excessively. You can also download backlinks from Google Search Console

o   NOTED: you can run basic backlinks from these tools for free, but to get
a comprehensive view, the paid version would be required 

*         Monitor Google Web Master Tools: 

o   Alerts: Make sure someone is setup to receive alerts when there are new
messages. This is where manual penalties would be sent from Google. 

o   Search appearance > structured data and HTML improvements: monthly
frequency

*  HTML improvements: Look through the duplicate title tags - the most
important 

*  Look through the 1st page of errors on a monthly basis and go deeper if
more bandwidth is available 

*         Google index: 

o   Index status: monthly basis 

o   Content KWs: 3 - 6 months 

o   Crawl errors: monthly 

o   Crawl stats: monthly

 

As always, please feel free to contact me with any questions or comments.

 

Best,

 

Terry 

 

Terry Erisman

[email protected] <mailto:[email protected]> 

 

 

 

Reply via email to