0
down vote
favorite
So you have these Microsoft KB support articles. Example: https://support.microsoft.com/en-us/help/871122
My goal is to be able to iterate through a list of these pages and see if the articles actually exist. As you can imagine, it's hard to do it manually, so I need a program to do that for me. I have tried loading a page with .NET HTML Agility Pack and also some PHP code posted below. Both result in a broken and useless page. Namely, this is what I get: https://i.stack.imgur.com/mKoED.png
Any ideas on why this is happening and what's something I can do to fix this?
Here is some PHP code I have tried.. nothing too special.
$url = 'https://support.microsoft.com/en-us/help/871122';
$htm = file_get_contents($url);
echo $htm;
Thanks!