Youtube is probably guessing which language to use based on your IP address, you can request a preferred language by using the accept-language header:

accept-language:en-US,en;q=0.8

As for parsing, you may find it easier to save the entire response to file before working with it. This will allow you to use (pretty much) the same socket events for all pages instead of creating duplicate events for every page you want.