There is a newer version of this record available.

Software Open Access

vtreat: A Statistically Sound 'data.frame' Processor/Conditioner

Mount, John; Zumel, Nina

A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). 'vtreat::prepare' should be used as you would use 'model.matrix'.

Files (2.7 MB)
Name Size
vtreat_1.0.2.tar.gz
md5:a48fc2ceb484177b21f383020bb4b739
1.3 MB Download
vtreat_1.0.3.tar.gz
md5:39e3ffe6fbae2234a3b362f7c5448e06
1.3 MB Download
37
0
views
downloads
All versions This version
Views 3710
Downloads 00
Data volume 0 Bytes0 Bytes
Unique views 3610
Unique downloads 00

Share

Cite as