An analysis of ridesharing trip time using advanced text mining techniques

Wenxiang Xu; Anae Sobhani; Ting Fu; Amir Mahdi Khabooshani; Aminreza Vazirinasab; Sina Shokoohyar; Ahmad Sobhani; Behnaz Raouf; Wenxiang Xu; Anae Sobhani; Ting Fu; Amir Mahdi Khabooshani; Aminreza Vazirinasab; Sina Shokoohyar; Ahmad Sobhani; Behnaz Raouf

doi:10.48130/DTS-2023-0026

The time cost of ridesharing rental represents a crucial factor influencing users' decisions to rent a car. Researchers have explored this aspect through text analysis and questionnaires. However, the current research faces limitations in terms of data quantity and analysis methods, preventing the extraction of key information. Therefore, there is a need to further optimize the level of public opinion analysis. This study aimed to investigate user perspectives concerning travel time in ridesharing, both pre and post-pandemic, within the Twitter application. Our analysis focused on a dataset from users residing in the USA and India, with considerations for demographic variables such as age and gender. To accomplish our research objectives, we employed Latent Dirichlet Allocation for topic modeling and BERT for sentiment analysis. Our findings revealed significant influences of the pandemic and the user's country of origin on sentiment. Notably, there was a discernible increase in positive sentiment among users from both countries following the pandemic, particularly among older individuals. These findings bear relevance to the ridesharing industry, offering insights that can aid in establishing benchmarks for improving travel time. Such improvements are instrumental in enabling ridesharing companies to effectively compete with other public transportation alternatives.

Data type

Description

Users' characteristics

Gender, age, user name, user ID, followers.

Timestamp

The timestamp of each tweet publishes.

Location

The county and location of the user.

The content of the tweet, the situation of the tweet (rewrite or not).

Sample of the tweet before and after the filter

Before filter: @Uber### I like

and miss # uberpull much, prices are odeeeeeeeeer cheaper #uber. https://t.co/OOLOYLexyC
After filter: I like and miss uberpool, these prices are cheaper.

Item

Label

Content

Description

Ridesharing trip time

Wait time

Wait time for the car

Time cost

The time cost from entering the car to ending the trip

Trip happen time

Trip time of day

Pandemic

Topic related to pandemic

Step

Items

Stad. E.

Wald

Freedom

Sig.

Exp(B)

95% CI

Intercept

−0.48

0.15

10.19

1.00

0.00

Pandemic

0.22

0.11

3.67

1.00

0.03

1.01

0.89

1.13

Gender

0.04

0.06

0.55

1.00

0.89

1.04

0.93

1.18

Age

−0.77

0.09

0.75

1.00

0.79

0.92

0.77

1.10

Country

0.33

0.12

4.45

1.00

0.02

1.12

1.01

1.23

Intercept

−0.49

0.16

9.65

1.00

0.02

−

Pandemic

0.12

0.13

4.66

1.00

0.03

1.05

1.01

1.15

Gender

0.01

0.12

0.01

1.00

0.97

1.00

0.79

1.27

Age

0.09

0.67

2.12

1.00

0.12

1.11

0.96

1.26

Country

−0.18

0.09

3.48

1.00

0.02

1.01

0.99

1.22

HTML

Introduction

Research on ridesharing trip time significantly impacts users' decisions in choosing between rideshare services and rental cars. Insights into user preferences regarding trip duration guide ridesharing companies in refining service efficiency. This research informs competitive strategies, highlighting factors that drive users to opt for immediate and reliable rental car alternatives. Additionally, it contributes to urban mobility planning and discussions on sustainable transportation solutions, shaping the evolving landscape of modern transportation^[1]. Contemporary research on ridesharing trip time commonly employs data analysis and survey methodologies. For instance, Zhang et al.^[2] utilized a data analysis approach, determining that ridesharing trip times are notably influenced by traffic conditions and demand fluctuations. However, limitations include potential biases in questionnaire responses and challenges in obtaining comprehensive longitudinal data. Existing methods also face difficulty in thoroughly exploring users' authentic attitudes toward car usage, particularly concerning public opinion analysis, resulting in an incomplete research framework.

This study delves into the impact of ride-sharing systems on various aspects of travel, including travel time, waiting time, and daily travel patterns, with a specific focus on distinct age and gender groups. The research context is particularly pertinent considering the escalating challenges posed by heightened urbanization and technological advancements, which have led to increased traffic congestion, prolonged travel times, and environmental concerns. In response to these challenges, researchers have proposed the development of an integrated and adaptable transportation system that harmonizes public transportation with digital technology. The emergence of ride-sharing platforms, exemplified by Uber in the USA, underscores the global demand for technologically driven travel solutions. While Lyft also operates in the US market, it is crucial to note that Uber currently commands a significant share of the market^[3]. However, it is imperative to acknowledge that the adoption of ride-sharing systems, particularly when supplanting traditional taxis and public transit, can potentially exacerbate issues such as heightened traffic congestion, increased Vehicle Kilometers Traveled (VKT), and a potential rise in traffic accidents^[4].

Over time, it has become evident that the implementation of the ride-sharing system has not led to a substantial reduction in traffic congestion and has posed accessibility challenges for individuals with lower incomes^[5]. Consequently, experts have turned their attention to alternative systems that can more effectively address issues related to traffic reduction, VKT reduction, and affordability for a broader spectrum of users. One such alternative proposed by experts is a platform-based sharing service that connects users with service providers, enabling the temporary sharing of goods and services. Ridesharing, in this context, is defined as the practice of multiple individuals sharing a car trip to reach a common destination^[6]. Passengers have the flexibility to use the service for either the entirety of their journey or only a portion, as defined by Transport for London^[7]. Various mobile applications are employed to manage and schedule ride-sharing services. Research has shown that ride-sharing services like Uber possess the potential to reduce individual car usage, shift the mode of transportation from single occupancy vehicles to shared rides, and promote off-peak travel, thus mitigating overall traffic congestion in urban areas^[8,9].

A study conducted during the COVID-19 pandemic revealed a notable decrease in ride-sharing traffic, exceeding the overall reduction in total traffic volume. Furthermore, non-shared trips during the pandemic exhibited increased travel distances, even though travel durations did not necessarily extend due to decreased network congestion^[10]. However, it is crucial to acknowledge that the ride-sharing industry faced significant disruptions due to pandemic-induced lockdowns imposed in numerous large cities^[11]. Research conducted by some scholars has indicated that the number of Uber users and trips had been steadily increasing until 2019. However, the growth rate experienced a sharp decline following the onset of the COVID-19 pandemic. A similar decrease in the growth rate was also observed for the Lyft platform during the pandemic. Distinguishing between ridesharing trip time scales (before, during, and after the pandemic) when assessing car rentals is imperative for several reasons. Firstly, the pandemic has profoundly altered travel patterns, impacting the demand and availability of rideshare services. Analyzing distinct time scales allows researchers to discern how the pandemic influenced user preferences, the frequency of rideshare usage, and its subsequent effects on the car rental industry. Secondly, variations in ridesharing trip times across different phases provide insights into evolving consumer behaviors. Understanding how preferences shift during and after the pandemic empowers car rental companies to tailor their services to changing demands, ensuring relevance and competitiveness. Additionally, discerning between these time scales aids in evaluating the resilience and adaptability of the car rental industry to external shocks. This nuanced analysis assists policymakers, industry stakeholders, and researchers in developing strategies that account for the dynamic nature of consumer behavior, contributing to a more comprehensive and forward-looking understanding of the market. In essence, differentiating between ridesharing trip time scales allows for a more nuanced understanding of the complex interplay between the pandemic, ridesharing, and car rentals, guiding informed decision-making and strategic planning in the evolving landscape of urban mobility.

Social media platforms, particularly Twitter, serve as abundant sources of real-time, user-generated content, reflecting personal experiences, opinions, and emotions. Through the analysis of Twitter data, researchers can extract spontaneous and unfiltered expressions, yielding valuable insights into user perceptions of ride-sharing travel times^[12]. Twitter's broad user base transforms it into a virtual forum where individuals openly share experiences, challenges, and opinions related to ridesharing. Extracting user perspectives from Twitter data facilitates the capture of diverse opinions and sentiments, providing a comprehensive understanding of factors influencing user choices and satisfaction in the ride-sharing realm^[13]. The textual nature of Twitter posts aligns seamlessly with text analysis methods, enabling researchers to uncover nuanced insights into travel times, service quality, pricing, and other aspects shaping users' perceptions of ridesharing^[14]. Employing advanced techniques like graph-based text extraction allows researchers to overcome challenges such as data loss, enhancing the extraction of key features and boosting the reliability of research results. Consequently, this study aims to investigate ridesharing trip times based on Twitter data, extracting relevant text features and public opinions for a thorough analysis.

Research objective
To delve deeper into the influence of ridesharing trip time on car rentals, this study refined methodologies in both text analysis and public opinion analysis. In text analysis, a novel graph-based text extraction method for Twitter big data is introduced to address issues like missing data and inadequate extraction of key features encountered in the current analytical process. In public opinion analysis, the existing model undergoes optimization through deep learning algorithms, rectifying inaccuracies in public opinion assessments. The specifics are outlined below: Considering the context outlined above, this study seeks to investigate the perspectives of a substantial user community within the Twitter application concerning ridesharing. The research focuses on users located in the USA and India, representing a developed and developing country, respectively, marked by significant cultural distinctions that can provide valuable insights for the effective management of ride-sharing systems, particularly during crises such as pandemics. To achieve this, a dataset comprising 63,800 Twitter posts was meticulously collected utilizing Text Mining techniques, with the inclusion of demographic details such as age, gender, and country of origin, both before and during the pandemic periods. The principal objective is to conduct a comparative analysis between the two countries based on the compiled dataset. The data analysis will be conducted employing the Bidirectional Encoder Representations from Transformers model (BERT), the Valence Aware Dictionary and sEntiment Reasoner (VADER) for sentiment analysis, and Latent Dirichlet Allocation (LDA) for topic modeling.

Literature review

This article primarily focuses on the examination of travel time within ride-sharing systems, such as Uber, and how the COVID-19 pandemic has impacted this aspect. Additionally, it explores user perspectives within broader platforms like Twitter, even though the existing literature on this topic is limited. The significance of this issue is underscored by an analysis of pre- and post-COVID-19 literature. Ridesharing holds the potential to mitigate traffic congestion, reduce Vehicle Miles Traveled (VMT), enhance air quality, stimulate economic growth, and optimize travel time for all road users. Notably, travel time within ridesharing is perceived as more valuable compared to train or bus travel in France^[15]. Consequently, individuals with higher incomes tend to prioritize their travel time, rendering ride-sharing an appealing option over other modes of transportation^[16]. The value of users' time and the uncertainty of travel time have been identified as pivotal factors significantly influencing the performance of ride-sharing systems^[17]. Ensuring immediate and optimal compliance is one avenue for achieving success in ride-sharing services. Ride-sharing service providers must furnish users with information regarding arrival time, travel duration, and financial benefits, as a failure to do so can result in trip cancellations. Dissatisfaction with travel time duration has also been observed among ride-sharing drivers. Research conducted in China indicates that tourists assign a higher value to travel time and are willing to pay more to reduce travel duration, particularly older residents who exhibit a propensity to pay extra to minimize waiting times^[18]. Another study in China estimated that ride-sharing users saved more than 1.7 billion hours of travel time between 2016 and 2018^[19]. In Mumbai, India, travel comfort is prioritized over security and waiting time^[20]. A survey conducted in the USA, involving 4,365 participants, revealed that a one-minute reduction in relative travel time per mile resulted in a 33% increase in ride-sharing usage^[21]. Notably, the COVID-19 pandemic induced short-term alterations in departure time, transportation mode, destination, and route choices. In Canada, a substantial portion of travel plans was either canceled or rescheduled due to extended travel durations. In China, following the pandemic, there was an improvement in the number of trips, albeit covering shorter distances. Conversely, Greece witnessed an increase in travel time during the COVID-19 pandemic. In summary, the pandemic has prompted transient shifts in travel behavior across various regions, influencing choices related to departure time, transportation mode, destination, and route.

By analyzing the attitudes towards ridesharing in different countries, we can more fully understand the impact of ridesharing trip time on ridesharing. Analyzing ridesharing in both India and the USA offers a compelling comparative perspective due to the distinct socio-economic, cultural, and infrastructural differences between the two countries. The motivation lies in understanding how these contextual disparities influence the dynamics of ridesharing services and user behaviors. India, characterized by a diverse population and varied urban landscapes, presents unique challenges and opportunities for ridesharing. Factors such as dense traffic, diverse commuting patterns, and varying economic conditions significantly impact ridesharing utilization. On the other hand, the USA, with its diverse cities and extensive transportation infrastructure, showcases a different ridesharing landscape shaped by cultural, economic, and regulatory factors. Comparing ridesharing in these countries enables researchers to draw nuanced conclusions about the adaptability of ridesharing models in diverse environments. For instance, insights into user preferences, pricing sensitivities, and the impact of regulatory frameworks can be gleaned by examining the differences in adoption patterns and service utilization. Research comparing ridesharing in different countries includes studies like 'An Analysis of Ridesharing in India: The Case of Uber and Ola'^[22] and 'The Competitive Effects of the Sharing Economy: How is Uber Changing Taxis?'^[23]. Understanding these variations aids ridesharing service providers, policymakers, and urban planners in tailoring strategies to specific market conditions. Additionally, it contributes to a broader understanding of the global ridesharing landscape, fostering insights that can be beneficial for the sustainable development of urban transportation systems.

Research concerning the utilization of Twitter data for the examination of user perspectives on ridesharing has evolved into an active and dynamic field. Within this realm, researchers and data scientists have harnessed Twitter data to extract insights into public sentiments, opinions, and attitudes vis-à-vis ridesharing services, including but not limited to Uber and Lyft. The primary advancements in this domain encompass two distinct methodologies. Firstly, topic modeling techniques such as Latent Dirichlet Allocation (LDA) have been instrumental in the identification of key topics and discussions related to ridesharing within the Twitter sphere. This facilitates a nuanced comprehension of the predominant themes and concerns within user conversations^[24]. LDA is a widely used and robust topic modeling technique that assumes a relationship between words and documents within a corpus represented as a bag-of-words. LDA has been applied in various domains, including healthcare^[25], e-petitions^[26], politics^[24], and evaluating social media strategies with both long-length (e.g., abstracts) and short-length (e.g., tweets) datasets^[27]. For example, Pournarakis et al.^[28] employed LDA for topic modeling in transportation services, developing and implementing a Genetic Algorithm based on LDA to categorize tweets into different topics, outperforming the K-means clustering approach. Another relevant study utilized Twitter data to explore ride-sharing services, indicating that LDA topic modeling efficiently extracts prevalent topics from large datasets^[29].

Secondly, sentiment analysis, a commonly employed technique, serves to gauge public sentiment towards ridesharing services on Twitter. This method leverages natural language processing (NLP) and machine learning algorithms to classify tweets into positive, negative, or neutral sentiment categories, thereby shedding light on users' emotional inclinations towards ridesharing platforms^[18]. BERT has gained popularity. It aims to enhance computers' comprehension of sentiment in complex language by establishing context through surrounding text. Researchers proposed an auxiliary sentence transformation (T)ABSA, which converts the single sentence classification problem into a sentence pair classification task using BERT^[15,25]. The results demonstrated that BERT-pair outperformed other models in aspect detection and sentiment analysis, using the SentiHood dataset. Traditional language models could only read text sequentially and lacked bidirectional capability^[30]. BERT introduced the concept of Transformers, which enabled bidirectional reading. The BERT framework was pre-trained on Wikipedia text and can be fine-tuned using a dataset of questions and answers, overcoming limitations related to data volume and dataset transfer in supervised methods^[9]. In this study, BERT was employed as a reference model, and logistic regression was used for sentiment retrieval and subsequent correlation analysis.

Nonetheless, it is imperative to acknowledge the limitations intrinsic to the use of Twitter data. These limitations encompass issues such as regional bias, limited representativeness, and the formidable challenge of distinguishing authentic user opinions from automated or bot-generated content. Among these constraints, regional bias and inadequacies in data mining loom as the principal impediments to contemporary user evaluation research. In pursuit of heightened research accuracy and representativeness, this paper embarks on an exploration of the perspectives held by a substantial user community within the Twitter application pertaining to ride-sharing, drawing upon data from the USA and India. This endeavor encompasses an in-depth analysis of data features utilizing both Latent Dirichlet Allocation (LDA) and BERT. The contributions of this paper can be encapsulated as follows:

1. A comprehensive analysis of data spanning diverse regions, thereby facilitating the elucidation of varying attitudes towards carpooling. This comparative approach serves to mitigate biases stemming from regional disparities.

2. This paper introduces a novel text analysis method based on graphs. The method enhances the robustness of text data by labeling data outside the normal distribution, thereby improving the accuracy of text feature extraction.

3. The implications of this research extend to the potential optimization of internal operational strategies for ridesharing companies. It also offers the prospect of tailoring distinct timing plans for passengers based on regional nuances, thereby bolstering the competitiveness of these firms.

Conclusions

This paper introduces a comprehensive methodology for modeling ridesharing trip time-related topics through LDA and for sentiment analysis employing BERT and multi-logit models. From the initial dataset of tweets, four principal topics are extracted, encompassing wait time, time cost, trip timing, and the impact of the pandemic. The study investigates variations in the distribution of ridesharing trip time topics among gender, age, and country categories both before and after the onset of the pandemic. Furthermore, it employs the BERT model to extract sentiment from each tweet within each group, showcasing the model's robust performance in time series sentiment analysis.

The study undertakes an in-depth analysis of the significance and interrelationships between sentiment and multiple variables. It further devises a sentiment regression model using the multi-logit approach to pinpoint the primary factors exerting influence on sentiment. The key findings can be summarized as follows:

1) The analysis reveals a pronounced emphasis on the topic of trip timing across all demographic groups. Additionally, there is an elevated interest in the wait time topic during the pandemic period, implying that passengers are placing greater importance on trip timing, particularly during specific periods such as mornings. This heightened focus on wait time may be attributed to extended waiting periods stemming from pandemic-related factors, resulting in heightened passenger concerns.

2) Both the USA and India exhibit similar topic distributions before and during the pandemic. Notably, male users express more concern regarding wait time compared to their female counterparts. Before the pandemic, the predominant topics for males and females are wait time and trip timing, respectively. However, during the pandemic, females display increased concern about time cost. Among younger age groups, trip timing takes precedence before the pandemic, while time cost becomes a more prominent consideration afterward. In contrast, the older age group exhibits a notable interest in wait time before the pandemic, but this concern diminishes during the pandemic. Instead, they display heightened concern for time cost and trip timing amid the pandemic.

3) Significant disparities emerge in sentiment between the pre-pandemic and pandemic periods, as well as among different countries. Customers exhibit a more positive sentiment during the pandemic, with passengers in the USA displaying a particularly favorable attitude towards ridesharing trip time. Gender and country-based distinctions are also evident in sentiment, with females and American passengers demonstrating more positive sentiment before the pandemic. However, during the pandemic, no significant differences are observed within each group. Notably, significant variations in sentiment are detected among older passengers and American passengers when comparing pre-pandemic and pandemic sentiments. These groups exhibit an increase in positive sentiment amid the pandemic, underscoring the diverse impact of the pandemic on individuals.

4) The regression model identifies the pandemic and country of origin as the primary factors influencing sentiment. During the pandemic, sentiment tends to skew more positive, with passengers in the USA displaying a notably more positive sentiment compared to their counterparts in India.

This study contributes a method for ridesharing trip time topic modeling and sentiment analysis, considering topic occurrence, trend changes, and sentiment time series variables, which have been largely unexplored in previous research. The methodology employed in this study has undergone enhancements, particularly in the realms of text feature extraction and sentiment analysis. The incorporation of graph-based techniques has proven effective in mitigating issues such as theme extraction inaccuracies and the limitations in sentiment analysis stemming from inadequate feature extraction, as observed in previous methodologies. Nevertheless, it is essential to acknowledge that the current method has its limitations in analyzing various emotions comprehensively and in revealing the nuanced layers of user sentiments toward ride-hailing services. To address these limitations and achieve more in-depth insights, further refinement and optimization of the method framework will be imperative in future research endeavors.

In light of our research outcomes, this paper offers the following recommendations. During morning and peak hours, careful consideration should be given to the adjustment of waiting times. Implementing discounts or appointment-based systems can mitigate user discontentment arising from prolonged waiting periods. Tailored strategies should be devised for users of varying genders. For instance, since men tend to be more concerned about waiting times, providing emotional support, such as discount coupons, can enhance their satisfaction. Acknowledge regional disparities in waiting times. Introducing distinct online ridesharing services tailored to specific regions can be a strategic approach to address varying user needs and preferences.

The contributions of this paper are as follows. First, this paper conducts a thorough examination of data from diverse geographic regions, enabling us to discern the varied attitudes of individuals in these regions toward carpooling through comparative analysis. This approach effectively mitigates bias issues stemming from regional disparities. Then, introducing a novel method for text feature extraction, this paper successfully eliminates the impact of aberrant data on result analysis, thereby enhancing the accuracy of user sentiment assessment. Finally, the insights and findings presented in this paper hold practical value for carpooling companies. They can be employed to inform adjustments in internal operational strategies and the formulation of distinct timing plans tailored to passengers in different regions, ultimately augmenting the competitiveness of these companies.

One notable limitation of this study pertains to the scope of research subjects, specifically focusing solely on users from the USA and India. Consequently, the dataset remains relatively limited in its volume. Additionally, the study primarily delved into a subset of user emotions. Integrating sentiment analysis with topic modeling and incorporating a more extensive emotional analysis could significantly enhance the granularity of sentiment examination. Future research endeavors could benefit from exploring additional variables, including but not limited to sadness, happiness, and excitement, to further enrich the landscape of sentiment analysis.

In future research, we plan to expand our dataset by extracting additional variables from Twitter texts, aiming to augment the depth and value of our text analysis. This includes gathering insights and suggestions regarding pricing and service quality. We will provide further details and descriptions pertaining to these enhancements in our forthcoming research endeavors.

Data type

Description

Users' characteristics

Gender, age, user name, user ID, followers.

Timestamp

The timestamp of each tweet publishes.

Location

The county and location of the user.

The content of the tweet, the situation of the tweet (rewrite or not).

Sample of the tweet before and after the filter

Before filter: @Uber### I like

and miss # uberpull much, prices are odeeeeeeeeer cheaper #uber. https://t.co/OOLOYLexyC
After filter: I like and miss uberpool, these prices are cheaper.

Item

Label

Content

Description

Ridesharing trip time

Wait time

Wait time for the car

Time cost

The time cost from entering the car to ending the trip

Trip happen time

Trip time of day

Pandemic

Topic related to pandemic

Step

Items

Stad. E.

Wald

Freedom

Sig.

Exp(B)

95% CI

Intercept

−0.48

0.15

10.19

1.00

0.00

Pandemic

0.22

0.11

3.67

1.00

0.03

1.01

0.89

1.13

Gender

0.04

0.06

0.55

1.00

0.89

1.04

0.93

1.18

Age

−0.77

0.09

0.75

1.00

0.79

0.92

0.77

1.10

Country

0.33

0.12

4.45

1.00

0.02

1.12

1.01

1.23

Intercept

−0.49

0.16

9.65

1.00

0.02

−

Pandemic

0.12

0.13

4.66

1.00

0.03

1.05

1.01

1.15

Gender

0.01

0.12

0.01

1.00

0.97

1.00

0.79

1.27

Age

0.09

0.67

2.12

1.00

0.12

1.11

0.96

1.26

Country

−0.18

0.09

3.48

1.00

0.02

1.01

0.99

1.22

[1]	Gehrke SR. 2020. Uber service area expansion in three major American cities. Journal of Transport Geography 86:102752 doi: 10.1016/j.jtrangeo.2020.102752 CrossRef Google Scholar
[2]	Zhang C, Zhu F, Wang X, Sun L, Tang H, et al. 2020. Taxi demand prediction using parallel multi-task learning model. IEEE Transactions on Intelligent Transportation Systems 23(2):794−803 doi: 10.1109/TITS.2020.3015542 CrossRef Google Scholar
[3]	Shaheen S, Totte H, Stocker A. 2018. Future of mobility white paper. UC Berkeley: Institute of Transportation Studies at UC Berkeley. http://dx.doi.org/10.7922/G2WH2N5D
[4]	Barrios JM, Hochberg YV, Yi H. 2023. The cost of convenience: Ridehailing and traffic fatalities. Journal of Operations Management 69:823−55 doi: 10.1002/joom.1221 CrossRef Google Scholar
[5]	Clewlow RR, Mishra GS. 2017. Disruptive transportation: The adoption, utilization, and impacts of ride-hailing in the United States. Institute of Transportation Studies, Working Paper Series qt82w2z91j. Institute of Transportation Studies, UC Davis.
[6]	Botsman R. 2017. Who can you trust?: how technology brought us together–and why it could drive us apart. UK: Penguin.
[7]	Raut A, Bhosale R, Avhad K, Awari M, Jadhav S. 2020. A Survey on: Real time Smart Car Pooling and Ride Sharing System using Android application. International Journal of Research and Analytical Reviews 7(1):593−97 Google Scholar
[8]	Li Y, Chung SH. 2020. Ride-sharing under travel time uncertainty: Robust optimization and clustering approaches. Computers & Industrial Engineering 149:106601 doi: 10.1016/j.cie.2020.106601 CrossRef Google Scholar
[9]	Du J, Rakha HA. 2020. COVID-19 impact on ride-hailing: The Chicago case study. Findings 00:1−7 doi: 10.32866/001c.17838 CrossRef Google Scholar
[10]	Morris EA, Zhou Y, Brown AE, Khan SM, Derochers JL, et al. 2020. Are drivers cool with pool? Driver attitudes towards the shared TNC services UberPool and Lyft Shared Transport Policy 94:123−38 doi: 10.1016/j.tranpol.2020.04.019 CrossRef Google Scholar
[11]	Shah D, Kumaran A, Sen R, Kumaraguru P. Travel Time Estimation Accuracy in Developing Regions: An Empirical Case Study with Uber Data in Delhi-NCR. WWW '19: Companion Proceedings of The 2019 World Wide Web Conference, San Francisco USA, May 13−17, 2019. New York, United States: Association for Computing Machinery. pp. 130−36. https://doi.org/10.1145/3308560.3317057
[12]	Boyd D, Golder S, Lotan G. 2010. Tweet, tweet, retweet: Conversational aspects of retweeting on twitter. 2010 43^rd Hawaii International Conference on System Sciences, Honolulu, HI, USA, 5-08 January 2010. USA: IEEE. pp. 1−10. https://doi.org/10.1109/HICSS.2010.412
[13]	Pang B, Lee L. 2008. Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval 2(1-2):1−135 doi: 10.1561/1500000011 CrossRef Google Scholar
[14]	Jin L, Mo C, Zhang B, Yu B. 2018. What is the focus of structural reform in China?—comparison of the factor misallocation degree within the manufacturing industry with a unified model Sustainability 10(11):4051 doi: 10.3390/su10114051 CrossRef Google Scholar
[15]	Monchambert G. 2020. Why do (or don’t) people carpool for long distance trips? A discrete choice experiment in France Transportation Research Part A: Policy and Practice 132:911−31 doi: 10.1016/j.tra.2019.12.033 CrossRef Google Scholar
[16]	Ciari F, Axhausen KW. 2012. Choosing carpooling or car sharing as a mode: Swiss stated choice experiments. Proc. 91^st Annual Meeting of the Transportation Research Board (TRB 2012), Washington D.C., 2012. Washington D.C.: Transportation Research Board (TRB). pp 1−23. https://doi.org/10.3929/ethz-b-000091515
[17]	Agatz N, Erera A, Savelsbergh M, Wang X. 2012. Optimization for dynamic ride-sharing: A review. European Journal of Operational Research 223:295−303 doi: 10.1016/j.ejor.2012.05.028 CrossRef Google Scholar
[18]	Adoma AF, Henry NM, Chen W. 2020. Comparative analyses of bert, roberta, distilbert, and xlnet for text-based emotion recognition. Proc. 2020 17^th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), Chengdu, China, 18−20 December 2020. USA: IEEE. pp. 117−21. https://doi.org/10.1109/ICCWAMTIP51612.2020.9317379
[19]	Wang B, Shao Y, Miao M. 2022. A social welfare estimation of ride-sharing in China: evidence from transaction data analysis of a large online platform. Technological and Economic Development of Economy 28:419−41 doi: 10.3846/tede.2022.16284 CrossRef Google Scholar
[20]	Mcauliffe J, Blei D. 2007. Supervised topic models. Advances in neural information processing systems 20, Princeton, 2007. Princeton: Princeton University. pp. 1−8.
[21]	Tufts C, Polsky D, Volpp KG, Groeneveld PW, Ungar L, et al. 2018. Characterizing tweet volume and content about common health conditions across Pennsylvania: retrospective analysis. JMIR Public Health and Surveillance 4:e10834 doi: 10.2196/10834 CrossRef Google Scholar
[22]	Kaur H, Sharma, D, Ahuja V. 2020. An analysis of ridesharing in India: The case of Uber and Ola. Information and Communication Technology for Sustainable Development, New York, 2020. Florida: CRC Press. pp. 261−75.
[23]	Cramer J, Krueger M, Haruvy E. 2016. The Competitive Effects of the Sharing Economy: How is Uber Changing Taxis? Retrieved from SSRN: https://ssrn.com/abstract=2974894
[24]	Hagen L. 2018. Content analysis of e-petitions with topic modeling: How to train and evaluate LDA models? Information Processing & Management 54:1292−307 doi: 10.1016/j.ipm.2018.05.006 CrossRef Google Scholar
[25]	Karami A, Bennett LS, He X. 2018. Mining public opinion about economic issues: Twitter and the us presidential election. International Journal of Strategic Decision Sciences (IJSDS) 9:18−28 doi: 10.4018/ijsds.2018010102 CrossRef Google Scholar
[26]	Karami A, Shaw G. 2019. An exploratory study of (#) exercise in the Twittersphere. iConference 2019 Proceedings, North Carolina, 2019. North Carolina: University of North Carolina at Charlotte. https://doi.org/10.21900/iconf.2019.103327
[27]	Karami A, Webb F, Kitzie VL. 2018. Characterizing transgender health issues in twitter. Proceedings of the Association for Information Science and Technology 55:207−15 doi: 10.1002/pra2.2018.14505501023 CrossRef Google Scholar
[28]	Pournarakis DE, Sotiropoulos DN, Giaglis GM. 2017. A computational model for mining consumer perceptions in social media. Decision Support Systems 93:98−110 doi: 10.1016/j.dss.2016.09.018 CrossRef Google Scholar
[29]	Collins M, Karami A. 2018. Social Media Analysis for Organizations: US Northeastern Public And State Libraries Case Study. Proceedings of the Southern Association for Information Systems Conference, Atlanta, GA, USA, 23–24 March, 2018. https://aiselaisnet.org/sais2018/30, www.semanticscholar.org/reader/3e893eb31105f0fdaee485ed11de8a0b87aff9c6
[30]	Sun C, Huang L, Qiu X. 2019. Utilizing BERT for aspect-based sentiment analysis via constructing auxiliary sentence. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, Minnesota, 2019. USA: Association for Computational Linguistics. pp. 380–85. https://doi.org/10.18653/v1/N19-1035
[31]	Blei DM, Ng AY, Jordan MI. 2003. Latent dirichlet allocation. Journal of Machine Learning Research 3:993−1022 Google Scholar
[32]	Aqlan WMM, Ali GA, Rajab K, Rajab A, Shaikh A, et al. 2023. Thalassemia screening by sentiment analysis on social media platform Twitter. Computers, Materials & Continua 76:665−86 doi: 10.32604/cmc.2023.039228 CrossRef Google Scholar
[33]	Qi Y, Shabrina Z. 2023. Sentiment analysis using Twitter data: a comparative application of lexicon-and machine-learning-based approach. Social Network Analysis and Mining 13:31 doi: 10.1007/s13278-023-01030-x CrossRef Google Scholar

Contents

Policies

Services

Partnerships

An analysis of ridesharing trip time using advanced text mining techniques

Abstract

Introduction

Research objective

Literature review

Materials and methods

Data collection and filtering

Topic modeling of ridesharing service

Time-related tweet extraction

Topic labeling based on keyword combinations

Sentiment analysis

VADER model

BERT model

Multi-logit regression

Results

Topic modeling performances and results

Keyword distribution and results

Topic modeling and development trend analysis

Sentiment analysis of ridesharing services

Sensitive and significant analysis

Ridesharing trip time sentiment multi-logit regression model

Conclusions

Author contributions

Acknowledgments

Conflict of interest

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors