Sample web server logs CSV.
An access log flattened to CSV: timestamp, client IP, method, path, status code, response time and user agent. For parsing practice, status-code charts and latency analysis.
An access log flattened to CSV: timestamp, client IP, method, path, status code, response time and user agent. For parsing practice, status-code charts and latency analysis.
The small files are static downloads. The large ones are generated in your browser from the same fixed seed, so every copy of web-logs-100000.csv on earth is byte-identical — reproducible test data with no 60 MB download.
→ Open this dataset in the dashboard builder · → Open in the CSV editor
| timestamp | ip | method | path | status | response_ms | user_agent |
|---|---|---|---|---|---|---|
| 2025-03-01T00:00:00Z | 38.197.234.35 | GET | /pricing | 200 | 28 | curl/8.5.0 |
| 2025-03-01T00:00:02Z | 193.31.206.152 | GET | /search | 200 | 45 | Mozilla/5.0 (Macintosh; Intel Mac OS X 14_5) Safari/605.1.15 |
| 2025-03-01T00:00:04Z | 45.134.206.17 | POST | /pricing | 200 | 12 | Mozilla/5.0 (Macintosh; Intel Mac OS X 14_5) Safari/605.1.15 |
| 2025-03-01T00:00:06Z | 95.169.100.203 | GET | /docs | 200 | 140 | curl/8.5.0 |
| 2025-03-01T00:00:08Z | 222.3.40.196 | GET | /assets/app.js | 200 | 45 | Mozilla/5.0 (iPhone; CPU iPhone OS 17_5 like Mac OS X) Mobile/15E148 |
| 2025-03-01T00:00:10Z | 139.93.85.71 | GET | /api/v1/items | 200 | 140 | Mozilla/5.0 (Windows NT 10.0; Win64; x64) Chrome/126.0 |
| 2025-03-01T00:00:12Z | 196.160.43.63 | GET | /docs | 200 | 12 | Mozilla/5.0 (iPhone; CPU iPhone OS 17_5 like Mac OS X) Mobile/15E148 |
| 2025-03-01T00:00:14Z | 196.226.99.183 | GET | /account | 200 | 28 | Mozilla/5.0 (iPhone; CPU iPhone OS 17_5 like Mac OS X) Mobile/15E148 |
| column | description |
|---|---|
timestamp | Request time (UTC) |
ip | Client IP (random, synthetic) |
method | HTTP method |
path | Request path |
status | HTTP status code |
response_ms | Response time (ms, right-skewed) |
user_agent | User agent |
Status codes follow a production-like distribution (mostly 200s, some redirects, a long tail of 404s and a sliver of 500s). Response times are right-skewed like real latency.
IPs are random across the full IPv4 space and user agents come from a small modern pool — synthetic, so no real visitor data is involved.
Good for: Log parsing and regex practice · Status-code and latency dashboards · Big-file pipeline testing at 1M rows.
License: CC0 / public domain — use it anywhere, no attribution needed.
CSV drops straight into spreadsheets, dashboards and SQL. If you need raw access-log format for a parser, this still gives you realistic field values to assemble from.
CC0 (public domain). Use it in tutorials, tests, courses, screenshots and products — no attribution required.
Yes — every size is generated from a fixed seed, so the same file is byte-identical for everyone, forever. Reproducible tests, stable teaching materials.