筱雅

verify-tagNYC Taxi Jan-Aug 2022

exploratory data analysisclusteringtabularregressiontravel

2

已售 0
87.06MB

数据标识:D17222465497591716

发布时间:2024/07/29

以下为卖家选择提供的数据验证报告:

数据描述

Description

TLC Trip Record Data Yellow and green taxi trip records include fields capturing pick-up and drop-off dates/times, pick-up and drop-off locations, trip distances, itemized fares, rate types, payment types, and driver-reported passenger counts. The data used in the attached datasets were collected and provided to the NYC Taxi and Limousine Commission (TLC) by technology providers authorized under the Taxicab & Livery Passenger Enhancement Programs (TPEP/LPEP). The trip data was not created by the TLC, and TLC makes no representations as to the accuracy of these data.

For-Hire Vehicle (“FHV”) trip records include fields capturing the dispatching base license number and the pick-up date, time, and taxi zone location ID (shape file below). These records are generated from the FHV Trip Record submissions made by bases. Note: The TLC publishes base trip record data as submitted by the bases, and we cannot guarantee or confirm their accuracy or completeness. Therefore, this may not represent the total amount of trips dispatched by all TLC-licensed bases. The TLC performs routine reviews of the records and takes enforcement actions when necessary to ensure, to the extent possible, complete and accurate information.

Proposed tasks

  • Identify the routes with the lowest price/km, time/km, and price/time ratios.
  • Visualize the evolution of the average travel time and distance throughout the day.
  • Calculate the probability of traveling from one zone to another in less than X minutes (where X is easily modifiable).
  • Analyze the zones where it is most likely to take a taxi based on the time of day.
  • Determine the best time of day to travel to the airport.
  • Design a model that predicts the travel duration and cost given the hour, origin, and destination zone. Show the relevance of the dataset attributes.

Features

VendorID: An ID code indicating the taxi vendor, 1 for Creative Mobile Technologies, LLC and 2 for VeriFone Inc. lpep_pickup_datetime: The date and time when the taxi ride started. lpep_dropoff_datetime: The date and time when the taxi ride ended. store_and_fwd_flag: Indicates whether the trip record was held in vehicle memory before sending to the vendor, Y=store and forward; N=not a store and forward trip. RatecodeID: The rate code for the trip, 1=Standard rate, 2=JFK, 3=Newark, 4=Nassau or Westchester, 5=Negotiated fare, 6=Group ride PULocationID: The pickup location ID, corresponding to the taxi zone where the taximeter was engaged. DOLocationID: The dropoff location ID, corresponding to the taxi zone where the taximeter was disengaged. passenger_count: The number of passengers in the vehicle. trip_distance: The distance of the trip in miles. fare_amount: The metered fare for the trip. extra: Extra charges. Currently, this only includes the 0.5 dollars and 1 dollar rush hour and overnight charges. mta_tax: The 0.50 dollars MTA tax that is automatically triggered based on the metered rate in use. tip_amount: Tip amount – This field is automatically populated for credit card tips. Cash tips are not included. tolls_amount: Total amount of all tolls paid in trip. ehail_fee: This is a $1.00 surcharge that is automatically applied to every trip booked through the ehail platform. improvement_surcharge: 0.30 dollars improvement surcharge assessed trips at the flag drop. The improvement surcharge began being levied in 2015. total_amount: The total amount charged to passengers. This field includes the metered fare, extra charges, mta_tax, tip_amount and tolls_amount plus any improvement_surcharge or ehail_fee. payment_type: A numeric code indicating the payment method: 1= Credit card, 2= Cash, 3= No charge, 4= Dispute, 5= Unknown, 6= Voided trip. trip_type: A code indicating whether the trip was a street-hail or a dispatch that is automatically assigned based on the metered rate in use but can be overridden by the driver. congestion_surcharge: 2.75 dollars congestion surcharge assessed trips in yellow and green taxis in Manhattan south of 96th St. The surcharge began being levied in 2019.

License

The data from the New York City Taxi and Limousine Commission (TLC) Trip Record Data website is available to the public under the Open Data Commons Open Database License (ODbL). This license allows for the use, sharing, and modification of the data as long as attribution is given to the original source and any derivative works are also licensed under the ODbL.

Citation

New York City Taxi and Limousine Commission (2019). TLC Trip Record Data. Retrieved from https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page

data icon
NYC Taxi Jan-Aug 2022
2
已售 0
87.06MB
申请报告