Embedding Facebook Media or Video in your blog posts? We discovered today that those embedded media items come with meta robots.txt block. This meta robots block will greatly effect your overall post seo score.
Updated 12/13/16: Facebook embedded Website Comments also comes with an undisclosed robots block.
What Facebook is saying: You can use this resource to display on your site, but we block any seo advantage to using it. Just like all external Facebook links are meta nofollow. This means putting a link to your website or blog will do no good seo wise, though you might get a few clicks. It also should be noted any outside link clicks must be approved by Facebook Link Shim. 🙁
From Google Webmaster tools help:
If a site’s robots.txt file disallows crawling these resources, it can affect how well Google renders and indexes the page, which can affect the page’s ranking in Google search.
This report shows resources used by your site that are blocked to Googlebot. Not all resources are shown, only resources that we think might be under your control.
The report landing page shows a list of hosts that provide resources on your site that are blocked by robots.txt rules. Some resources will be hosted on on your own site, and some will be hosted on other sites.
Click on any host in the table to see a list of blocked resources from that host, with a count of pages on your site affected by each blocked resource.
Click on any blocked resource in the table for a list of your pages that load the resource.
Click on any page in the table hosting a blocked resource for instructions on how to unblock that resource, or else follow the instructions below under How to unblock your resources.
To unblock your resources:
Open the Blocked Resources Report to find a list of hosts of blocked resources on your site. Start with the hosts that you own, since you can directly update the robots.txt files, if needed. You might not have control of all the hosts that we list, but change the ones that you can.
Click a host on the report to see a list of blocked resources from that host. Go through the list and start with those that might affect the content and layout in a meaningful way. Less important resources, such as tracking pixels or counters, aren’t worth bothering with.
For each resource that affects layout, click to see a list of your pages that uses it. Click on any page in the list and follow the pop-up instructions for viewing the difference and updating the blocking robots.txt file. Fetch and render your page after each change to verify that the resource is now appearing.
Continue updating resources for a host until you’ve enabled Googlebot access to all the important blocked resources.
Move on to hosts that you don’t own, and if the resources have a strong visual impact, either contact the webmaster of those sites to ask them to consider unblocking the resource to Googlebot, or consider removing your page’s dependency on that resource.