-
Notifications
You must be signed in to change notification settings - Fork 11
Open
Labels
Description
related to #87 , but we have seemingly errant depth data the is coming from the Water Quality Portal.
After transforming the units, here are the distributions of the data:
depth.units p.25 p.50 p.75 count
(chr) (dbl) (dbl) (dbl) (int)
1 feet 1.8288 5.4864 10.02792 57365
2 ft 0.9144 0.9144 1.82880 93566
3 in 0.2540 0.3810 0.45720 27
4 m 1.0100 4.0590 9.00000 730459
5 meters 3.0000 8.0000 16.00000 53368
6 mm 0.0130 0.0130 0.01300 1
note that the depth.units is the original unit, and the values in the p.25, p.50, etc are the transformed measurements, all in m
A couple of things could be going on here:
- certain agencies/groups that use certain units may have errant data in here
- certain groups may use a certain unit (
ftfor example) and collect mostly surface measurements - bad data is spread across agency identifiers and units randomly, or follows some kind of pattern we can use to ultimately make a judgement call to remove certain data.
Will be updating this issue w/ progress.
@lawinslow noted this difference between data coming from the WI DNR, and the WI section of the WQP:
WQP

