Is there a single, full-proof method for determining a site's home page when all you know is the domain?
I'm frequently faced with this challenge, but sometimes find exceptions. It sounds easy enough, but you don't always know if the site uses http or https, defaults to www subdomain, or even requires a full file name rather than just a slash. My usual approach is to assume the site has proper redirects in place to do the work for me, so this works most of the time:
UnshortUrl( UrlProperty($A1, "absolute") )
Lately, I come across some sites that appear to prevent simple browsing from anything except legitimate desktop or mobile browsers. For example, here's what the above formula renders for these domains:
footlocker.com → Error:403 Forbidden http://www.footlocker.com/
dickssportinggoods.com → http://www.dickssportinggoods.com/UnsupportedBrowserErrorView?langId=-1&storeId=15108
networksolutions.com → timeout.html'
I've tried changing my User-Agent in GlobalSettings config, as well as various combinations of CollectCookies, IntervalBetweenRequests, and Accept-Language. Perhaps there is a specific Request Header I can set to spoof the server into believing I'm a real browser?
I'd appreciate any suggestions from the SeoTools Community.