If you ’ve ever played the gameGeoGuessr , you ’ll bonk how tough it is to locate somewhere purely by an image . The online game plunk you in a random localisation on Google Street View and it ’s your job to determine where you are anywhere in the world only using the visible clues . But even as spacial animate being , us humans find it tough to tell our Australian outback   from our South African bush .

Now , Google   has developed an artificial intelligence information machine capable of “ superhuman levels of truth ” when it come to guessing a location based on just the picture element of a   exposure .

Tobias Weyand , one of the brains at Google , and a team who specialize in produce “ computer visual sense ” have developed PlaNet –   a deep neural web up to of determining where an image was guide , purely on its pel .

Article image

To develop the computing machine ’s “ spacial memory ”   the squad fed in   91 million geotagged image . Using this data point , they   chopped up a map into a gridiron consist of over 26,000 squares , with the   size of the squares proportional to how many of the images had been shoot there . Big cities with highly concentrated population   be given to be made up of more square compared to rural arena . As such , there were n’t enough paradigm to admit the ocean or the icy regions .

Throughout this unconscious process , the cryptic neural connection picks up on vogue and visual cues that appear in photo in the same control grid such as color , texture , and shape . For example , the web could pick up on the distinctive red and dark-brown hues of the Grand Canyon ’s rock from the countless photographs of the area .

Some of PlaNet ’s guesses .   prototype credit : Google / Tobias Weyand et al .

To try out out the neural internet , they gave it   2.3 million geotagged photographs from Flickr to see if it could guess where the image was read . The reckoner correctly hazard the continent of 48 percent of images , and make up one’s mind the nation of 28.4 percent of them . On top of this , the   studyclaims it can " localize 3.6 pct of the persona at street - grade accuracy and 10.1 percent at city - level accuracy . ”

While this might not voice massively precise , it ’s a tidy sum better than humans . The squad put PlaNet up against   “ 10 well - travel human subjects ” in a game of GeoGuessr . satellite won 28 times out of 50 with an median mistake of 1,131.7 kilometers ( 703 geographical mile ) , while the man were off - target by an norm of 2320.75 kilometers ( 1442 miles ) .

Remarkably , the researchers claim PlaNet   even has the power to see the location of images of indoor environs by picking up on specific items , MIT Technology Reviewreports .

You might think humans would have an advantage , as the study explicate : “ In the absence of obvious and discriminative landmarks , humans can settle back on their creation cognition and use multiple cue to guess the placement of a pic .

“ The oral communication of street sign or the driving management of railway car can serve narrow down possible position . Traditional electronic computer vision algorithms typically lack this kind of reality noesis , relying on the feature film provided to them during education . ”

However , Weyand and co suggest PlaNet really has the advantage as it has been fertilise more mental image of places than any human could ever travel and larn to acknowledge pernicious cue stick from the jillion of different scenes .

you’re able to read a full write - up of their ontogenesis in a study inarXiv .

[ H / T : MIT Technology Review ]