I have a script that concatenates two HTML files into one. It literally just inserts the 2nd HTML code right after the first.
htmlfile1 = urllib.urlopen(url1) htmlfile2 = urllib.urlopen(url2) htmltext1 = htmlfile1.read() htmltext2 = htmlfile2.read() name=symbolslist[i]+'.html' o=open(name, "w") o.write(htmltext1) o.write(htmltext2) o.close()
In my other thread I seem to be having trouble parsing information on the 2nd HTML part using bs4, when the solution is correct.
I have no issue parsing information on the first HTML.
thread: beautifulsoup parsing - dealing with superscript?
Therefore I was wondering if Beautiful Soup works or not on concatenated HTML.