FASTEX Central Archive Ship Quality Controls


SHIPS Surface in-situ data: data set description page

  1. Overview
  2. The real time quality control processing
    , as recommended by the World Meteorological Organization (WMO), was automatically applied on operational data received at Meteo-France (in Toulouse, France) from the Global Telecommunication System (GTS), in WMO BUFR format. See the quality control procedures handbook for more information.

    But, for meteorological experiments analyses, and especially for case studies, a measurement can be very important in the interpretation. In these cases, the real time quality control processing appeared insufficient.

    Following that, the FCA decided to improve the validation of the commercial ships surface measurements data set. This includes:

  3. gross limit checks on several parameters,
  4. temporal and/or spatial consistency,
  5. when necessary, a visual examination of data.
  6. Here are the main problems found:

  7. coding error
  8. use of a wrong unit (whole hPa instead of tenths of hPa)
  9. error when coding the quadrant of the globe (used to determine latitude and longitude signs)
  10. error when coding sign (air temperature)
  11. latitude typed in place of longitude and vice versa
  12. typing error:
  13. inverted figures (i.e. 46 instead of 64)
  14. use of a neighbouring key (ELTY0 instead of ELTY9 )


  15. Data checking
  16. Foreword: to improve the validation of the commercial ships surface measurements data set, the FCA used several 'limit values' which are described in the next sections. They have two aims:
  17. to improve the real time gross limit checks by using limits more accurate for the domain of interest (in space and time)
  18. to find and examine not only bad values but also suspect values: these limits are not used as 'limits that can't be passed', but are defined as values such as most of the values above (or below, according to the case) appear as being suspect, and so need a closer examination.
    1. Gross limit checks
    2. As real time gross limits appeared too large for the FASTEX domain and period, the following values were used to complete the control processing. For each doubtful observation, data were visually examined and compared to neighbouring observations (in space and time). If a problem appeared, the wrong value was flagged and, in some cases, corrected (5: good, 6: maybe, 7: bad, 8: estimated); see the data correction section for more details.

      In the table: B = bad, Q = questionable, E = estimated, G = good, P = pressure, T = temperature, RH = relative humidity, FF = wind speed

      Parameter Gross Limit Check
      real time values (FASTEX domain and period)
      Gross Limit Check
      FCA values
      Parameter flagged Flag applied
      Mean Sea Level Pressure< 910 hPa or > 1080 hPa< 960 hPa or > 1040 hPaPG/Q/B/E
      Temperature <-30 C or >50C if latitude<45N
      <-80C or >35C if latitude>45N
      <-20C or > 30C T/RH G/Q/B/E
      Relative Humidity TD<-35C or >35C if latitude<45N
      TD<-85C or >30C if latitude>45N
      < 30% T/RH G/Q/B/E
      Wind Speed > 60m/s if latitude<45N
      >50m/s if latitude>45N
      > 35 m/s FF G/Q/B/E

      Note that as the humidity checks went on, several ships appeared as having lower humidities than neighbouring ships, but without any way to settle if these data were doubtful or not. Flags were not modified, but low humidity values must be used with care .

    3. WMO ship's call sign checks
    4. A number of messages arrived from the GTS with a bad ship's call sign. There are several reasons for that: typing error when the message was sent on the GTS, problem of transmission, use of the name 'SHIP' or of the date instead of the call sign. These erroneous call signs were found by looking at too short or too long call signs (they usually have between 4 and 8 letters), by looking at ships which broadcasted less than 5 messages during the FASTEX period, by looking at call signs which were not referenced in the WMO ship list.

      For each doubtful call sign, the message was compared to the other messages received in a spatio-temporal window of +/- 12 hours and +/- 3 degrees in latitude and longitude. The message was also compared to the messages sent by ships with closely related call signs. Some of these doubtful call sign were corrected in this way, without any uncertainty.

      Following that, it appeared that a number of messages were in fact corrective messages, but not identified as it because the call sign was not correct. Only the good ones were keeped in the FCA.

      The following table sums up the different cases:

      nb of processed messages cause action
      954 (0.55%)unknown call signstatu quo
      451 (0.26%)unknown call signcorrected into a known call sign
      2137 (1.2%)unknown call signcall sign='SHIP'
      820 (0.47%)duplicated transmissionsdeleted

    5. Ship trajectory consistency
    6. In the real time quality control processing, a ship trajectory consistency already exists. It was completed in delayed time by the FCA in the following way.

      Using parameters available in the SHIP messages, several checks were made on each ship trajectory:

    7. locations of a ship were used to calculate, between every two successive locations, an average speed over the period between these two locations ( < 48 hours).
    8. if the ship course speed was included in the message, and the difference between this average speed and the ship course speed was greater than 12 m/s, the location was checked.
    9. when the ship course speed was not included in the message, every average speed greater than 50 km/h was checked.
    10. the ship locations were also compared to the locations estimated using the previous location and the ship course heading and speed. When the distance between two locations was greater than a limit value, the ship locations, heading and speed were checked. This limit value depends on the ship speed, the time between two successive observations, the heading steadiness, and also includes the lack of precision due to the WMO coding itself, estimated in the following table:

      parameter precision in the SHIP message lack of precision in km
      latitude0.1 degree11 km at 50 degrees N
      longitude0.1 degree7 km at 50 degrees N
      ship heading360 degrees / 8 sectors=45 degrees
      1. speed ~ 15km/h, steady heading, 6h interval: ~48km,
      2. speed ~ 25km/h, steady heading, 6h interval: ~66km
      ship speedby 5 kt intervals

      We can see that the error just due to the coding appears to be quite large. Nevertheless, this kind of control allowed to find several wrong ship positions. As an example, the following limits were used in this processing:

      time between 2 messages ship speed code ship speed limit value
      6 hours215km/h90km
      6 hours325km/h130km
    11. Each doubtful location was visually checked, flagged if necessary and, in some cases, corrected (see the data correction section for more details).

    12. Land/Sea mask
    13. A new land/sea mask was used, more accurate than the operational one, calculated from the USGS Microwave databank II coastlines, lakes and islands at 0.1 degrees resolution, which is the precision of the ships location.

      Positions flagged as being over land by the automatic QC procedure were checked with this new mask:

    14. if 3 or more of the 8 points surrounding the position of the ship were water points, then the ship was automatically flagged as good;
    15. for the other doubtful locations, values were visually checked and a geographical atlas was used in some cases. Flags were updated if necessary and, in some cases, data were corrected (see the data correction section for more details).
    16. Controls on pressure
    17. Foreword: in a SHIP message, the group '4PPPP' is used to report the air pressure at mean sea level. Most of ship messages include it. But another group exists to report the pressure at station level (group 3P0P0P0P0). Its use appeared rare in SHIP messages.

      A first check was made concerning several ships reporting pressure using the 'group 3'. Two different kinds of ships appeared:

    18. ships with only a few 'groups 3' (between 1 and 5 data): for these ships, it appeared that data included in the 'group 3' could not be pressure values. These completly aberrant values were removed.
    19. ships with a lot of 'groups 3' (available in most of the messages from this ship). After controls (pression temporal evolution, comparison with 'pressure at mean sea level' of the same ship, comparison with mean sea level pressure maps), these data appeared consistent. These four ships/platforms are named 45135, 45139 (which are both in fact moored buoys reporting in SHIP message), DBFZ and FNXR.
    20. Following checks were made on the 'pressure at mean sea level'. Afterwards in this section and to lighten the writing, the name of 'pressure' will be used for 'pressure reduced to mean sea level'.

      Each pressure measurement was compared to 'neighbouring' observations (i.e. a 500 km square over a 12 hour interval, centered on the point which is controlled; this area might be extended when a few observations are available, or reduced for a great number of observations). When the difference between the pressure value and the average of the neighbouring observations was greater than a value set to 12hPa, data were checked with the use, in some cases, of the sea level pressure field from the ARPEGE analysis.

      After that, for each ship, the temporal evolution of pressure was compared to limit values depending on the time between the two measurements; these limits have to include the fact that ships move between two measurements, and so pressure can increase or decrease faster than for a ground station.

      temporal interval limit on pressure tendency limit on temperature tendency
      1 hour6 hPa6 degrees C
      3 hours13 hPa8 degrees C
      6 hours20 hPa12 degrees C
      12 hours30 hPa15 degrees C
      24 hours40 hPa18 degrees C

      A third check was also applied, using the pressure tendency. When, for a ship, the time between two pressure observations was equal to 3hours, and when the pressure tendency was available (allowing the computation of the first pressure using the second pressure and the tendency), the difference between the first value (from the message) and the calculated value was computed. If the difference was greater than 3hPa per 6 hours, pressure was visually checked. But tendency often appeared to be doubtful.

    21. Controls on temperature
    22. For each ship, the temporal evolution of the temperature was calculated and compared to the limit values given in the previous table. These limits include the fact that ships move between two measurements, and so temperature can increase or decrease faster than for a ground station.

      Every time the temporal evolution was greater than the appropriate limit, data were visually checked, corrected if an obvious error appeared, and flagged according to the result of the checking.

  19. Data correction
    1. WMO ship's call sign
    2. see section 2.b

    3. Latitude and longitude data
    4. In the following cases, and only in these cases and when the error is obvious, the decision was taken to correct position values:

      error source controls used to show up errors parameters used to correct data
      • latitude/longitude coded in whole hPa instead of tenths of hPa
      • error when coding the quadrant of the globe (used to determine latitude and longitude signs)
      • latitude typed in place of longitude and vice versa in the message
      • inverted figures or use of a neighbouring key on the keyboard
      • Ship trajectory consistency, see section 2.c for more details
      • real time latitude and longitude flags set to 1 (suspect value) or 2 (ship over land, see section 2.d)
      • previous/next latitude/longitude values of the same ship (if not too far)
      • use of the ship heading/speed when available

    5. Pressure data
    6. In the following cases, and only in these cases and when the error is obvious, the decision was taken to correct pressure values:

      error source controls used to show up errors parameters used to correct data
      • pressure coded in whole hPa instead of tenths of hPa
      • reading error on the instrument
      • inverted figures or use of a neighbouring key on the keyboard
      • wrong parameter in the pressure group in the SHIP message (problem of transmission?)
      • gross limit checks and/or
      • comparison with neighbouring observations and/or
      • temporal evolution of pressure for this ship and/or
      • use of the ship pressure tendency and/or
      • comparison with the closer sea surface pressure map (over Atlantic) and/or
      • real time pressure flag set to 1 (suspect value) or 2 (bad value)
      • neighbouring pressures from other ships when available
      • and previous/next pressure values of the same ship (if not too far)
      • and pressure tendency when available

    7. Temperature data
    8. In the following cases, and only in these cases and when the error is obvious, the decision was taken to correct temperature values:

      error source controls used to show up errors parameters used to correct data
      • temperature coded in whole degrees instead of tenths of degrees
      • error when coding the sign
      • reading error on the instrument
      • inverted figures or use of a neighbouring key on the keyboard
      • wrong parameter in the temperature group in the SHIP message (problem of transmission?)
      • gross limit checks
      • temporal evolution of temperature for this ship and/or
      • use of checks on humidity (a wrong humidity may be related to a wrong temperature) and/or
      • real time temperature flag set to 1 (suspect value) or 2 (bad value)
      • neighbouring temperatures from other ships when available
      • and previous/next temperature values of the same ship (if not too far)

      Note that in some cases, it led to flag also humidity values (very low humidity values or obviously too high values due to an error in temperature for example).

    9. Wind data
    10. In the following cases, and only in these cases and when the error is obvious, the decision was taken to correct wind speed values:

      error source controls used to show up errors parameters used to correct data
      • error when coding the unit of wind speed
      • gross limit checks
      • real time wind speed flag set to 2 (bad value)
      • neighbouring wind speeds from other ships when available
      • and previous/next wind speed values of the same ship (if not too far)

    11. Flagging/correcting data
    12. When a correction is made, the corresponding flag is set to 8 (corrected value).

      When the correction is not obvious, but the value obviously suspect or bad, flags were used to improve the data qualification (set to 6 or 7). Conversely, some values flagged to suspect or bad by the real time control were obviously good data and the flag was set to 5 (case of a value set to suspect because of an error when decoding the message, or of a ship position over land, position which could be corrected).

  20. Summary
  21. The following table sums up the percentage (number between brackets) of suspect, bad and corrected data using all these checkings:

    Parameter data flagged as correct data flagged as suspect or bad data flagged as corrected
    location 2.2% (3886) 0.3% (471) 0.16% (285)
    pressure 0.003% (5)0.35% (604) 0.29% (493)
    temperature0.05% (82) 0.04% (72) 0.04% (72)
    wind speed 0.01% (23) 0.09% (148)0.02% (37)

Updated: 15 Jan 1999

Top of page