The SGP package is open source software built for the R software environment. Initially released on October 17th, 2008, the package is under active development on GitHub by Damian Betebenner and Adam Van Iwaarden. The classes, functions and data within the SGP package are used to calculate student growth percentiles and percentile growth projections/trajectories using large scale, longitudinal education assessment data. Quantile regression is used to estimate the conditional density associated with each student’s achievement history. Percentile growth projections/trajectories are calculated using the derived coefficient matrices and show the percentile growth needed to reach future achievement targets.
SGP analyses are designed to be performed (broadly) in a simple two step process:
Like with all data analysis, the bulk (> 90%) of the time is spent on data preparation. Once data is prepared correctly, analyses conducted using the SGP are intended to be simple and straightforward. Analyses that we assist on all follow this two step process. Running SGP calculations is designed to be as simple as possible. The following sections provide basic details to get you started with the required software/hardware, data preparation and data analysis.
As a package built for the R software environment, to use the SGP package one needs a computer with R installed. R is available for Window, OSX, and Linux and, being open source, can be compiled for just about any operating system. Running SGP analyses assumes some familiarity with using R. If you’re new to R, you might want to spend some time familiarizing yourself with it before diving into running SGP analyses. There are several resources available on CRAN for getting started with R.
After installing R, one needs to install the SGP
package. Because the SGP package utilizes several other
R packages, those will be installed concurrently.
Installing R packages can be done in several different
ways depending upon whether one is using a development environment for
R (e.g., RStudio), the
installed R client, or calling R from
the command line. In all cases, packages can be installed from the
R command prompt (>) as follows:
To install the latest stable release of SGP from CRAN
To install the development release of SGP from GitHub:
There a two formats for representing longitudinal (time dependent) student assessment data: WIDE and LONG format. For WIDE format data, each case/row represents a unique student and columns represent variables associated with the student at different times. For LONG format data, time dependent data for the student is spread out across multiple rows in the data set.
In general, the lower level functions in the SGP package that do the
calculations, studentGrowthPercentiles and
studentGrowthProjections require WIDE formatted data
whereas the higher level functions (wrappers for the lower level
functions) require LONG data. If running anything but the basic
analyses, we recommend setting up your data in the LONG format as much
of the capability of the package is built around the user supplying data
in that way.
To assist in setting up your data, the SGPdata package, installed when one installs the SGP package, includes exemplar WIDE and LONG data sets (sgpData and sgpData_LONG, respectively). Examples in the vignettes as well as validation tests done within the SGP package use these exemplar data sets. Detailed vignettes on WIDE and LONG data preparation are available at:
Following proper data preparation, SGP analyses are straightforward. In almost all cases, any errors that come up in data analysis usually revert back to data preparation issues so that there is usually some back and forth between data preparation and analysis.
If you have a topic for an SGP vignette that you’d like to see, don’t hesitate to write or set up an issue on GitHub.