Mapping Tornado Alley with R

Mapping Tornado Alley with R

| | cartography, ggplot, maps, R
I caught a re-tweet of this tweet by @harry_stevens: THREAD: I wrote a post on @observablehq about a map I made today. It shows a typical day in the life of a graphics journalist: You never know what problems you'll have to solve on deadline! https://t.co/yRhW1wbLxN #d3js #dataviz 1/7 pic.twitter.com/7N6mmK0nz3 ... Read More
🔗

Quick Hit: Updates to QuickLookR and {rdatainfo}

| | macos, R
I’m using GitUgh links here b/c the issue was submitted there. Those not wishing to be surveilled by Microsoft can find the macOS QuickLook plugin project and {rdatainfo} project in SourceHut and GitLab (~hrbrmstr and hrbrmstr accounts respectively). I hadn’t touched QuickLookR🔗 or {rdatainfo}🔗 at all since 2016 since it ... Read More
Quick Hit: Scraping javascript-“enabled” Sites with {htmlunit}

Quick Hit: Scraping javascript-“enabled” Sites with {htmlunit}

| | R, web scraping
I’ve mentioned {htmlunit} in passing before, but did not put any code in the blog post. Since I just updated {htmlunitjars} to the latest and greatest version, now might be a good time to do a quick demo of it. The {htmlunit}/{htmunitjars} packages make the functionality of the HtmlUnit Java ... Read More
datacamp

A Note to Our Community On How To Hide Your Content From Search Engines

| | Leadership
Say your organization has done something pretty terrible. Terrible enough that really didn’t want to acknowledge it initially but eventually blogged about it, and haven’t added a blog post in a long time so that entry is at the top of your blog index page which Google can still index ... Read More
Lost In [SQL] Translation: Charting d[b]plyr Mapped SQL Function Support Across All Backends

Lost In [SQL] Translation: Charting d[b]plyr Mapped SQL Function Support Across All Backends

| | dplyr, R
Like more posts than I care to admit, this one starts innocently enough with a tweet by @gshotwell: Is there a reference document somewhere of which dplyr commands work on various database backends? #rstats — Gordon Shotwell (@gshotwell) April 9, 2019 Since I use at least 4 different d[b]plyr backends ... Read More
A Limited-but-Functional Couchbase Free Text Search & Retrieval Un-package; or, “How I Abused Couchbase & R to Perform Bulk IP Whois Full-text Searches” (a Cobbler’s Tale)

A Limited-but-Functional Couchbase Free Text Search & Retrieval Un-package; or, “How I Abused Couchbase & R to Perform Bulk IP Whois Full-text Searches” (a Cobbler’s Tale)

| | R
Researching “the internet” (i.e. $DAYJOB) means having to deal with a ton of “unique” (I’m being kind) data formats. This is ultimately a tale of how I performed full-text searches across one of them. It all started off innocently enough. This past week I need to be able to do ... Read More
Better Anomaly Detection with the T-Digest #WhiteboardWalkthrough

Wicked Fast, Accurate Quantiles Using ‘t-Digests’ in R with the {tdigest} Package

| | R
@ted_dunning recently updated the t-Digest algorithm he created back in 2013. What is this “t-digest”? Fundamentally, it is a probabilistic data structure for estimating any percentile of distributed/streaming data. Ted explains it quite elegantly in this short video: Said video has a full transcript as well. T-digests have been baked ... Read More
💙

Rome Was Not Built In A Day But widgetcard Was!

| | R
I saw a second post on turning htmlwidgets into interactive Twitter Player cards and felt somewhat compelled to make creating said entities a bit easier so posited the following: Wld this be useful packaged up, #rstats?https://t.co/sfqlWnEeJVhttps://t.co/troKzmzTNv (TLDR/V: Single function to turn an HTML widget into a deployable interactive Twitter card) ... Read More

Assumptions Matter More Than Dependencies

| | R
There’s been alot of talk about “dependencies” in the R universe of late. This is not really a post about that but more of a “really, don’t do this” if you decide you want to poke the dependency bear by trying to build a deeply flawed model off of CRAN ... Read More

Collecting Content Security Policy Violation Reports in S3 (‘Effortlessly’/’Freely’)

| | Cybersecurity, R
In the previous post I tried to explain what Content Security Policies (CSPs) are and how to work with them in R. In case you didn’t RTFPost the TLDR is that CSPs give you control over what can be loaded along with your web content and can optionally be configured ... Read More