Subset selection may produces lower prediction error, however, because it is a discrete - variables are either retained or discarded - it often exhibits high variance. Shrinkage methods are more continuous, and don’t suffer as much from high variability.