A few days ago, @mozilla sacked 250 of the best FOSS technologists on this planet arguing serious budget constraints.
Today, we learn that Mozilla signed a three-year, $1.2+ billion deal with Google, and expected that to remain under wraps until November.
Despite of the love we have for the work they've done for decades the Mozilla organisation is insulting the FOSS community and its own amazing employees.
Not a good show, @mozilla.
BS4 always felt awkward to me. Doing scraping in JS is more natural. https://gitlab.com/philn/fb2feed/-/blob/master/fb2feed/data/fb-scraper.js
Productive week-end on the Fluy/fb2feed front. Scraping is now done as injected JS scripts, added UserMessage support and SHM as well. https://gitlab.com/philn/fluyt
@aperezdc I used a VM, it took a few days (not 15) back then (2 years ago) and I don't even remember what kind of details they ask :) Perhaps rubbish, but it's a one-time thing anyway.
Why NIH PhantomJS? Well PhantomJS is un-maintained, likely plagued with security bugs, and Qt based anyway, which I don't want on my server. WPE is a perfect fit for this project.
A Flatpak repo is provided for convenience. Tested on a Debian Stretch box (x86_64) with Flatpak 1.2 from stretch-backports. Works like a charm!
So, I use WPEWebKit, with an headless backend and a web-extension that fakes a user interaction (scroll) and waits a second before dumping the HTML body to the stdout.
I'll use this for a while and spin a 0.4 release soon, hopefully! Testers welcome!
Igalian / Free software hacker / pronoun.is/he
The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!