pyindex_domains Documentation can be found here To do better optimization for faster parallel processing needed (also processes are dieing)... better network connexion or using RPyC - Transparent, Symmetric Distributed Computing on cloud parse html from site also issue: gethostbyname() does not support IPv6 name resolution, and getaddrinfo() should be used instead for IPv4/v6 dual stack support.