<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>Univrsity of Tehran Press</PublisherName>
				<JournalTitle>Industrial Management Journal</JournalTitle>
				<Issn>3115-7386</Issn>
				<Volume>16</Volume>
				<Issue>4</Issue>
				<PubDate PubStatus="epublish">
					<Year>2024</Year>
					<Month>11</Month>
					<Day>21</Day>
				</PubDate>
			</Journal>
<ArticleTitle>Dynamic Pricing of Customer Classes in Rail Transportation Systems Using Deep Q Network Algorithm</ArticleTitle>
<VernacularTitle>قیمت‌گذاری پویای کلاس‌های مشتریان در سیستم‌های حمل‌ونقل ریلی با استفاده از الگوریتم شبکۀ عمیق Q</VernacularTitle>
			<FirstPage>597</FirstPage>
			<LastPage>630</LastPage>
			<ELocationID EIdType="pii">99727</ELocationID>
			
<ELocationID EIdType="doi">10.22059/imj.2024.377050.1008164</ELocationID>
			
			<Language>EN</Language>
<AuthorList>
<Author>
					<FirstName>Omid</FirstName>
					<LastName>Niknami</LastName>
<Affiliation>MSc. Student, Department of Industrial Engineering, Faculty of Industrial and Systems Engineering, Tarbiat Modares University, Tehran, Iran.</Affiliation>

</Author>
<Author>
					<FirstName>Elham</FirstName>
					<LastName>Akhondzadeh Noughabi</LastName>
<Affiliation>Assistant Prof., Department of Information Technology, Faculty of Industrial Engineering and Systems, Tarbiat Modares University, Tehran, Iran.</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
			<History>
				<PubDate PubStatus="received">
					<Year>2024</Year>
					<Month>05</Month>
					<Day>27</Day>
				</PubDate>
			</History>
		<Abstract>&lt;strong&gt;Objective&lt;/strong&gt;
This research investigates the problem of dynamic pricing in rail transportation systems using advanced deep reinforcement learning techniques. The main goal is to optimize the revenue of railway transport companies by developing a ticket sales policy that dynamically adjusts ticket prices based on service classes. This approach allows rail transport companies to enhance revenue and profitability by accurately aligning prices with passenger demand.
 
&lt;strong&gt;Methods&lt;/strong&gt;
To solve the problem of dynamic pricing, this research utilizes the Q deep network algorithm, which combines deep neural networks with Q-learning. Deep neural networks approximate Q values instead of using a costly Q table. The Q deep network algorithm is widely used due to its ability to learn optimal policies in complex environments. As reinforcement learning models are often too complex to analyze, numerical experiments and simulations are used to analyze different pricing strategies.
 
&lt;strong&gt;Results&lt;/strong&gt;
The simulations demonstrate that the Q deep network algorithm successfully converges to a stable pricing policy. Various performance indicators were investigated, including such as total revenue, remaining capacity, average prices offered to customers, and the number of tickets sold in each service class. The algorithm showed improvement in the early stages and gradually achieved stability. The average total revenue converges to 225,000 after 5,000 iterations, indicating that the company earns an average of 225,000 monetary units from each train. The average residual capacity approaches zero after approximately 3,000 iterations, indicating that the reinforcement learning agent learns to sell all available tickets to maximize total revenue. The average price index offered to customers stabilizes after approximately 7,500 iterations, indicating that the algorithm has converged to an optimal pricing policy. In this state, the average prices remain within the range of 680 to 700 monetary units, with no significant fluctuations observed. In other words, the reinforcement learning model has successfully converged based on the average proposed price index.&lt;strong&gt;&lt;em&gt; &lt;/em&gt;&lt;/strong&gt;Finally, after about 5,000 iterations, the average number of tickets sold for all service classes reaches a stable level. The average number of tickets sold for economy class is around 175 to 180 tickets, for business class is around 130 to 135 tickets, for special&lt;strong&gt;&lt;em&gt; &lt;/em&gt;&lt;/strong&gt;class is around 60 to 65 tickets, and for hotel class is around 23 to 25 tickets.
 
&lt;strong&gt;Conclusion&lt;/strong&gt;
The findings of this study suggest that employing the Deep Q-Network algorithm in dynamic pricing can lead to substantial optimization in revenue management for railway transportation systems.&lt;strong&gt; &lt;/strong&gt;The results of this research indicate that after approximately 7,500 iterations, the Q deep network algorithm reaches an optimal and stable policy with no significant changes in performance. It can be concluded that the use of the Q deep network algorithm in dynamic pricing can significantly improve the revenue management of rail transportation systems. This algorithm can learn and adapt to changing conditions, allowing for effective pricing policies to maximize revenue and determine the optimal number of tickets sold in each service class. The obtained findings can help rail transport companies improve pricing strategies and increase economic productivity.</Abstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Dynamic pricing</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Reinforcement Learning</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">rail transportation</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://imj.ut.ac.ir/article_99727_90ae11416a117502b7e35f3ded8510f0.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
