Hi, Racket Discourse.
I would like to ask what the current state is of web scraping libraries in Racket, and whether it would be better to just roll my own.
I am thinking about building some functionality to help my team gather IOCs (indicators of compromise) and metadata regarding current malware campaigns and the like. We're quite a small team, so nothing ostentatious, but it would be nice to be able to gather the data and process it into queries/formats that our SIEM and EDR systems can understand, for example.
I can see a couple of salient resources:
But I've never worked with SXML
before, so I have no idea what to expect. I probably won't build a bona fide crawler, but perhaps something that can be run once when you come upon a site with some interesting information that you'd like to quickly scrape and process. Our EDR, SentinelOne, has a similar integration/plug-in called SentinelOne Hunter; however, I often find myself thinking that it's rather limited.
As I mention, this is still only a pipe-dream, so any advice or commentary would be welcome.
Thanks!