Variable type: numeric
| skim_variable | n_missing | complete_rate | mean | sd | p0 | p25 | p50 | p75 | p100 | hist |
|---|---|---|---|---|---|---|---|---|---|---|
| Community | 9 | 0.89 | 216.12 | 339.90 | -402 | -40.5 | 169.5 | 444.25 | 1000 | ▃▇▅▃▂ |
| Hospital | 0 | 1.00 | 283.38 | 359.94 | -540 | -1.0 | 292.0 | 525.00 | 1031 | ▁▇▇▇▃ |
MATH 221
May 2, 2024
There are 85 rows and 2 columns. The data source1 is used to create our data that is stored in our pins table. You can access this pin from a connection to posit.byui.edu using hathawayj/copd_rehab.
This data is available to all.
Variable type: numeric
| skim_variable | n_missing | complete_rate | mean | sd | p0 | p25 | p50 | p75 | p100 | hist |
|---|---|---|---|---|---|---|---|---|---|---|
| Community | 9 | 0.89 | 216.12 | 339.90 | -402 | -40.5 | 169.5 | 444.25 | 1000 | ▃▇▅▃▂ |
| Hospital | 0 | 1.00 | 283.38 | 359.94 | -540 | -1.0 | 292.0 | 525.00 | 1031 | ▁▇▇▇▃ |
NULL
library(tidyverse)
library(pins)
library(connectapi)
copd_rehab <- read_csv('https://github.com/byuistats/data/raw/master/COPD-Rehab/COPD-Rehab.csv')
# Publish the data to the server with Bro. Hathaway as the owner.
board <- board_connect()
pin_write(board, copd_rehab, type = "parquet", access_type = "all")
pin_name <- "copd_rehab"
meta <- pin_meta(board, paste0("hathawayj/", pin_name))
client <- connect()
my_app <- content_item(client, meta$local$content_id)
set_vanity_url(my_app, paste0("data/", pin_name))This data is available to all.
Direct Download: copd_rehab.parquet
For public data, any user can connect and read the data using pins::board_connect_url() in R.
library(pins)
url_data <- "https://posit.byui.edu/data/copd_rehab/"
board_url <- board_connect_url(c("dat" = url_data))
dat <- pin_read(board_url, "dat")Use this custom function in Python to have the data in a Pandas DataFrame.
import pandas as pd
import requests
from io import BytesIO
def read_url_pin(name):
url = "https://posit.byui.edu/data/" + name + "/" + name + ".parquet"
response = requests.get(url)
if response.status_code == 200:
parquet_content = BytesIO(response.content)
pandas_dataframe = pd.read_parquet(parquet_content)
return pandas_dataframe
else:
print(f"Failed to retrieve data. Status code: {response.status_code}")
return None
# Example usage:
pandas_df = read_url_pin("copd_rehab")Our connect server is https://posit.byui.edu which you assign to your CONNECT_SERVER environment variable. You must create an API key and store it in your environment under CONNECT_API_KEY.
Read more about environment variables and the pins package to understand how these environment variables are stored and accessed in R and Python with pins.
National Heart Lung and Blood Institute, National Institutes of Health, U.S. Department of Health & Human Services. What is chronic obstructive pulmonary disease (COPD)? Website, March 2009. http://www.nhlbi.nih.gov/health/dci/Diseases/Copd/Copd_WhatIs.html. J. C. Waterhouse, S. J. Walters, Y. Oluboyede, and R. A. Lawson. A randomised 2x2 trial of community versus hospital pulmonary rehabilitation for chronic obstructive pulmonary disease followed by telephone or conventional follow-up. Health Technology Assessment, 14(6), 2010.↩︎