Trying to automate print to PDF process.
Have a problem getting page data? Had a campus directory page that before allowed wget2 to download pages fine for over 10 years. Now that modified pages, and get nothing… (прочетете още)
Have a problem getting page data? Had a campus directory page that before allowed wget2 to download pages fine for over 10 years. Now that modified pages, and get nothing? Can open the new page https://employeessb-prod.ec.guamcc.edu/EmployeeSelfService/ssb/campusDirectory#/lastName But trying to manually save page results in a 1.3M file, while using print to PDF gives a 50K file? Seems it is some kind of dynamic page, so the Save as gives a ton of garbage Java code, versus saving the final web page. Printing to PDF seems to take the final form and print the correct information.
Currently takes a 2 1/2 minutes to go to the first page, Ctrl-P, and print to PDF File. Then have to use Next option to go to pages 2-6, and manual print each one, changing the save file name each time. Makes like 6 50K files. Use pdftotext to change to text, and then combine into one file to extract data.
What took 6 seconds with wget2 to download, now takes 2 1/2 minutes??