Ambiguous Date Parsing

ExtractionDifficulty: HARD11 models comparedTotal cost: $0.010487

Prompt

Extract all dates mentioned and convert to ISO 8601 format. Reply as JSON array.

"The contract was signed on 3/4/24 and takes effect next Tuesday. The first milestone is due in 6 weeks, with final delivery by end of Q2. Note: the original deadline of Feb 29th was moved due to the leap year issue."

Model Responses

Claude Opus 4.6

Unrated
5,647ms357 tokens
```json [ { "original": "3/4/24", "iso_8601": "2024-03-04", "note": "Assumed MM/DD/YY format (US convention)" }, { "original": "next Tuesday", "iso_8601": null, "note": "Relative date; cannot determine without a reference date" }, { "original": "in 6 weeks", "iso_8601": null, "note": "Relative date; depends on the contract signing date (approx. 2024-04-15 if from 3/4/24)" }, { "original": "end of Q2", "iso_8601": "2024-06-30", "note": "End of Q2 2024" }, { "original": "Feb 29th", "iso_8601": "2024-02-29", "note": "2024 is a leap year, so Feb 29 exists" } ] ```

GPT-4.1

Unrated
898ms132 tokens
```json [ "2024-03-04", "2024-03-12", "2024-04-23", "2024-06-30", "2024-02-29" ] ```

GPT-4.1 Mini

Unrated
989ms104 tokens
```json [ "2024-03-04", "2024-02-29" ] ```

GPT-4.1 Nano

Unrated
435ms100 tokens
[ "2024-03-04", "2024-02-29" ]

o3

Unrated
8,128ms1040 tokens
[ "2024-03-04", "2024-03-05", "2024-04-16", "2024-06-30", "2024-02-29" ]

o3 Mini

Unrated
9,086ms1260 tokens
["2024-03-04", "2024-02-29"]

o4 Mini

Unrated
11,003ms1360 tokens
[ "2024-03-04", "2024-03-05", "2024-04-15", "2024-06-30", "2024-02-29" ]

GPT-4o Mini

Unrated
1,756ms113 tokens$0.000032
```json [ "2024-03-04", "2024-02-29", "2024-06-30" ] ```

Claude Haiku 4.5

Unrated
1,564ms214 tokens$0.000730
```json [ { "original": "3/4/24", "iso8601": "2024-03-04" }, { "original": "Feb 29th", "iso8601": "2024-02-29" } ] ``` **Note:** "next Tuesday," "6 weeks," and "end of Q2" are relative dates without specific calendar dates, so they cannot be converted to ISO 8601 format without additional context (e.g., knowing the reference date).

GPT-4o

Unrated
3,634ms332 tokens$0.002720
To convert the dates mentioned in the given text to ISO 8601 format, we first need to interpret each date: 1. "3/4/24" - This can be ambiguous as either March 4, 2024, or April 3, 2024. However, the context helps determine its meaning. In ISO format, assuming MM/DD/YY, it becomes "2024-03-04". 2. "next Tuesday" - Without an explicit reference date, we cannot determine the exact date. 3. "in 6 weeks" - Similar to "next Tuesday", without a reference date, this cannot be determined. 4. "end of Q2" - End of Q2 typically means the end of June, so it would be "2024-06-30". 5. "Feb 29th" - As leap years occur every four years, the next occurrence after 2024 will be "2028-02-29". Assuming today's date is not needed for calculations as it isn't provided in the text, the dates that can be extracted and converted are: ```json ["2024-03-04", "2024-06-30", "2028-02-29"] ```

Claude Sonnet 4.6

Unrated
8,469ms535 tokens$0.007005
Here is my analysis of the dates mentioned, with notes on ambiguity: ```json [ { "original": "3/4/24", "iso8601": "2024-03-04", "notes": "Ambiguous: could be March 4 (MM/DD/YY) or April 3 (DD/MM/YY). Defaulting to MM/DD/YY (US format) → 2024-03-04" }, { "original": "next Tuesday", "iso8601": null, "notes": "Relative date — cannot convert without knowing the reference date of the document" }, { "original": "in 6 weeks", "iso8601": null, "notes": "Relative date — cannot convert without knowing the reference/start date" }, { "original": "end of Q2", "iso8601": "2024-06-30", "notes": "Assuming calendar year 2024 based on context; Q2 ends June 30" }, { "original": "Feb 29th", "iso8601": null, "notes": "Invalid date — February 29 did not exist in 2024 (2024 IS a leap year, so this is valid: 2024-02-29). The text says it was moved 'due to the leap year issue', implying the prior cycle (e.g. 2023) was not a leap year. Possible intended date: 2024-02-29" } ] ``` > ⚠️ **Key caveats:** Three of the five dates are relative and require a known anchor date to resolve. The `3/4/24` format is regionally ambiguous. The `Feb 29th` reference contains an internal contradiction in the source text.

Cost & Performance Comparison

ModelLatencyTokensCostVerdict
Claude Opus 4.65,647ms357
GPT-4.1898ms132
GPT-4.1 Mini989ms104
GPT-4.1 Nano435ms100
o38,128ms1,040
o3 Mini9,086ms1,260
o4 Mini11,003ms1,360
GPT-4o Mini1,756ms113$0.000032
Claude Haiku 4.51,564ms214$0.000730
GPT-4o3,634ms332$0.002720
Claude Sonnet 4.68,469ms535$0.007005