Paired Ends
Subscribe
Sign in
Home
R
Python
Papers
TIL
AI
Archive
About
R
Latest
Top
R 4.5.0 and Bioconductor 3.21
Faster package installation, import only the functions you want with use(), built-in Palmer penguins data, grep values shortcut, and lots of new…
Apr 17
•
Stephen Turner
Exploring the bioRxiv API with R, httr2, rvest, tidytext, and Datawrapper
Collect metadata and publication details for >200k preprints over a 10 year period, investigate trends, and scrape full text for sentiment analysis
Feb 10
•
Stephen Turner
Bluesky conversation analysis with local and frontier LLMs with R/Tidyverse
Summary, translation, and sentiment analysis of Bluesky posts on a topic using atrrr, tidyverse/ellmer, and mlverse/mall with Claude and other open…
Dec 30, 2024
•
Stephen Turner
Use an LLM to translate help documentation on-the-fly
The lang package overrides the ? and help() functions in your R session. If you are using RStudio or Positron, the translated help page will appear in…
Dec 14, 2024
•
Stephen Turner
Turn a GitHub repo into a single text file for LLM-friendly input
The open source repo2txt web app will take a GitHub repo URL, displays the directory structure, lets you choose which files to include, and provides a…
Dec 9, 2024
•
Stephen Turner
TIL: dplyr::mutate()'s .keep argument
Use the .keep argument in dplyr::mutate() to control which variables to retain after mutating to create new variables.
Dec 2, 2024
•
Stephen Turner
Tech I'm thankful for (2024)
Data science and bioinformatics tech I'm thankful for in 2024: tidyverse, RStudio, Positron, Bluesky, blogs, Quarto, bioRxiv, LLMs for code, Ollama…
Nov 25, 2024
•
Stephen Turner
Expand your Bluesky network with R
Find people followed by the people you follow, but who you don't follow, using R and the atrrr package
Nov 20, 2024
•
Stephen Turner
Bluesky for Science
Bluesky starter packs for genomics, bioinformatics, R, and Nextflow
Nov 16, 2024
•
Stephen Turner
Python for R users
A Google search for “R vs Python” returns thousands of hits across sites like Reddit, IBM, Datacamp, Coursera, Kaggle, and many others.
Oct 21, 2024
•
Stephen Turner
Use nanoparquet instead of readr/CSV
Parquet is interoperable between Python and R, fast to read+write, works well with databases, and stores complex data types (e.g., tibble listcols). Use…
Oct 8, 2024
•
Stephen Turner
DuckDB vs dplyr vs base R
Using DuckDB in R to analyze 100 million rows of data in 3 seconds
Oct 7, 2024
•
Stephen Turner
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts