Skip to content

2016-03-25 validation data issues (depth of water temp) #88

@jordansread

Description

@jordansread

related to #87 , but we have seemingly errant depth data the is coming from the Water Quality Portal.

After transforming the units, here are the distributions of the data:

depth.units   p.25   p.50     p.75  count
        (chr)  (dbl)  (dbl)    (dbl)  (int)
1        feet 1.8288 5.4864 10.02792  57365
2          ft 0.9144 0.9144  1.82880  93566
3          in 0.2540 0.3810  0.45720     27
4           m 1.0100 4.0590  9.00000 730459
5      meters 3.0000 8.0000 16.00000  53368
6          mm 0.0130 0.0130  0.01300      1

note that the depth.units is the original unit, and the values in the p.25, p.50, etc are the transformed measurements, all in m

A couple of things could be going on here:

  1. certain agencies/groups that use certain units may have errant data in here
  2. certain groups may use a certain unit (ft for example) and collect mostly surface measurements
  3. bad data is spread across agency identifiers and units randomly, or follows some kind of pattern we can use to ultimately make a judgement call to remove certain data.

Will be updating this issue w/ progress.

@lawinslow noted this difference between data coming from the WI DNR, and the WI section of the WQP:
WQP
image

DNR
image

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions