Deployment optimization of laser chargers in self-organizing power transfer internet of things

Junzhe Hu; Chuanwen Luo; Yi Hong; Guopeng Wang; Xin Fan; Deying Li; Junzhe Hu; Chuanwen Luo; Yi Hong; Guopeng Wang; Xin Fan; Deying Li

doi:10.48130/wpt-0025-0004

Recently, the Internet of Things (IoT) has played an important role in many fields. Nevertheless, the fast and uneven energy consumption of IoT Devices (IoTDs) significantly limits the lifetime of IoT networks. One of the effective solutions is to deploy Laser Static Chargers (LSCs) to power IoTDs. However, deploying LSCs to cover all IoTDs will consume enormous costs. To prolong the lifetime of IoT and reduce the deployment costs of LSCs, in this paper, we first propose a novel IoT network named Self-organizing Power Transfer IoT with Laser Static Chargers (SPTIoT-LSC), where IoTDs are equipped with laser transmission and reception modules allowing energy transfer between IoTDs, and several LSCs are deployed into the network to charge IoTDs. Based on SPTIoT-LSC, we study the Minimizing Laser Chargers Coverage(MLCC) problem, which aims to minimize the number of LSCs deployed in SPTIoT-LSC while enabling all IoTDs to work continuously. Then we prove its NP-hardness. To solve the problem, we propose two sub-algorithms: the Layered Charging Scheduling Strategy (LCSS) algorithm and Deploy Chargers based on the Multi-agent deep deterministic policy gradient (DCM) algorithm to maximize the working time of IoTDs with given LSCs and corresponding positions and deploy given LSCs in SPTIoT-LSC, respectively. Based on the above sub-algorithms, we propose an approximation algorithm to solve the MLCC problem. Finally, extensive experiments are proposed to verify the efficiency of the proposed algorithm and the superiority of SPTIoT-LSC.

HTML

Related works

This section discusses relevant research and puts forward the differences between previous literature and this paper. We classify the investigated problems into two types: wireless charging in IoT, and deployment optimization of static chargers.

Wireless charging in IoT
Commonly used wireless charging methods include magnetic resonance coupling, solar charging, laser charging, etc. In the study by Xie et al.^[8], they utilized a wireless charging vehicle with the magnetic resonant coupling charging method to provide periodic recharging for sensors in a wireless sensor network. However, magnetic resonant coupling technology suffers from limited charging distances, facing the problems of high deployment cost and low charging efficiency. In previous studies^[9−11], solar charging was used to prolong the lifetime of IoTDs. However, the solar charging method is weather-dependent, and in bad weather or at night, IoTDs cannot be charged. Laser charging, in contrast, offers a reliable and stable energy supplyment, less susceptible to weather variations. The attenuation of laser beam transmission in air is very small^[12], making it an effective long-distance charging method. In previous studies^[13,14], laser charging technology was used to replenish energy for electric vehicles, overcoming the drawbacks including long charging times and short charging ranges. In a study by Fu et al.^[7], an Unmanned Aerial Vehicle (UAV) was utilized to collect data from IoTDs and charge IoTDs using laser charging technology. By optimizing the UAV's trajectory, they maximized its residual energy while meeting all IoTDs' energy requirements. In the research of Zhang et al.^[15], they used a laser-beamed WPT technology and presented a multitier tile grid-based spatial structure to charge Aerial User Equipment (AUE) in a high-altitude platform.

Deployment optimization of static chargers
A large body of literature is dedicated to minimizing the deployment cost of static chargers in IoT networks while meeting the energy requirements of IoT devices. Liao & Jiang^[16] studied the problem: minimizing the number of chargers deployed on grid points at a fixed height to make the wireless rechargeable sensor network sustainable. And then they presented two greedy algorithms to solve it. In the research of Chen & Jiang^[17], a Particle Swarm Charger Deployment (PSCD) algorithm was proposed to deploy chargers by using the Particle Swarm Optimization (PSO) method. The results indicated its superiority compared with two greedy algorithms proposed by Liao & Jiang^[16]. In the study by Chien et al.^[18], the authors presented a layoff algorithm combining the Simulated Annealing-based (SA) method to optimize the chargers' positions in the WRSN. The proposed algorithm can reduce the number of chargers efficiently according to their simulation results. In the research of Li et al.^[19], they studied how to efficiently deploy wireless static chargers in UAV networks. They presented a binary integer programming method to determine the minimum number of wireless static chargers as well as the location of each charger so that the energy requirements of UAVs are satisfied during flight. Liu et al.^[20] investigated the problem of charging nodes with uncertain mobility by static chargers. They proposed a genetic-algorithm-based multi-objective optimization scheme to address the charger deployment problem. Lin et al.^[21] proposed a hybrid search and removal strategy to discover the minimum number of chargers required to cover all sensor nodes. You et al.^[22] studied a fundamental issue of wireless charger placement with obstacles and proposed a greedy algorithm for placing chargers. In summary, there are many articles studying the optimization of static charger deployment. However, the aforementioned studies neglected the energy transfer between devices, thereby necessitating the deployment of more chargers to ensure comprehensive coverage for IoTDs, which is more costly.

It is clear from previous discussions that there are many studies on wireless charging in IoT and the deployment optimization of static chargers. However, they have not studied the energy transfer between IoTDs. Inspired by the above research, this paper, investigates the Minimize Laser Chargers Coverage problem in SPTIoT networks by considering laser charging technology, deployment optimization of chargers, and energy transfer between IoTDs, to minimize the number of LSCs deployed in SPTIoT and satisfy the energy requirements of all IoTDs.

Models and definition

In this section, we first introduce the network model. Then, we give the laser charging model for charging IoTDs. Finally, the formal definition of our problem is presented.

Network model
In this paper, we consider the network architecture of SPTIoT with Laser Static Chargers (SPTIoT-LSC), where a set of n IoTDs S = {s₁, s₂, ···, s_n} is randomly located at a two-dimensional square detection area $ A\subseteq {\mathfrak{R}}^{2} $ for a monitoring mission and several LSCs are deployed into the area to replenish energy for IoTDs. In the network, each IoTD $ {s}_{i}\in S $ has a two-dimensional coordinate (x_i, y_i). All IoTDs have the uniform battery capacity E_M and energy threshold E_T. The IoTD stops working when its remaining energy is less than E_T. Since IoTDs are in different positions and may take different tasks in the mission, each IoTD $ {s}_{i}\in S $ has its unique energy consumption rate δ_i.

To satisfy the charging requirements of the network, multiple LSCs are deployed in the detection area to replenish energy for IoTDs, which can be collected in the set C = {c₁, c₂, ···, c_m}. For simplicity, we assume that IoTDs and LSCs have the same laser transmission distance R_c. For any IoTD $ {s}_{i}\in S $, it can be charged by $ {c}_{j}\in C $ if and only if $ {d}_{i{{,}}j}^{c}\le {R}_{c} $, where $ {d}_{i{{,}}j}^{c} $ denotes the Euclidean distance between s_i and c_j. Similarly, any IoTD $ {s}_{i}\in S $ can be charged by $ {s}_{k}\in S $ if and only if $ {d}_{i{{,}}k}^{s}\le {R}_{c} $, where $ {d}_{i{{,}}k}^{s} $ denotes the Euclidean distance between s_i and s_k. Let $ {\mathcal{N}}_{i}^{s}=\left\{{s}_{k}\right|k\ne i{{,}}{d}_{i{{,}}k}^{s}\le {R}_{c}\} $ represent the set of neighbor IoTDs of $ {s}_{i}\in S $.

Furthermore, let T_w represent the working period during which all IoTDs must remain continuously operational in SPTIoT-LSC. We discretize T_w evenly into $ \mathrm{\Gamma } $ time slots, with each slot having a duration of $ l=\dfrac{{T}_{w}}{\mathrm{\Gamma }} $. The set of time slots is denoted as $ \mathcal{T}=\{\mathrm{1{{,}\;}2}{{,}}\;\cdots{{,}}\;\mathrm{\Gamma }\} $. For any IoTD $ {s}_{i}\in S $, we utilize $ {E}_{i}^{r}\left(\tau \right) $ to signify its remaining energy at time slot $ \tau $, with the initial energy set as $ {E}_{i}^{r}\left(0\right)={E}_{M} $.

Laser charging model

We consider the one-on-one laser charging model. We use a linear laser energy harvesting model^[23,24] with efficiency ω to derive the energy transmission link from the c_j to s_i. The power of s_i received from c_j can be expressed as:

$ \begin{array}{l}{P}_{c}^{j{{,}}i}=\left\{\begin{array}{l}\dfrac{\left(1-{\delta }_{s}\right){\eta }_{el}\omega {A}_{c}\chi {e}^{-\alpha {d}_{i{{,}}j}^{c}}}{(D+{d}_{i{{,}}j}^{c}{\mathrm{\Delta }}_{\mathrm{\theta }}{)}^{2}}{P}_{c}{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{j}}^{\mathrm{c}}\le {\mathrm{R}}_{\mathrm{c}}\\ 0{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{j}}^{\mathrm{c}} \gt {\mathrm{R}}_{\mathrm{c}}\end{array}\right.\end{array} $

(1)

where, P_c is the source power of LSC, δ_s represents the the power splitting factor to separate the energy transfer link and communication link, $ {\eta }_{el} $ is the electricity-to-laser conversion efficiency, A_c is the area of the charging panel of each IoTD, $ \chi $ represents the optical efficiency of the combined transmission-receiver, α is the laser attenuation coefficient, $ D $ denotes the size of the initial laser beam, $ {\mathrm{\Delta }}_{\theta } $ represents the angular expansion of the laser beam.

Similar to the laser charging model from LSC to IoTD, for any pair of $ {s}_{i}\in S $ and $ {s}_{k}\in S $, we can obtain the power of s_i received from s_k as:

$ \begin{array}{c}{P}_{s}^{k{{,}}i}=\left\{\begin{array}{l}\dfrac{\left(1-{\delta }_{s}\right){\eta }_{el}\omega {A}_{c}\chi {e}^{-a{d}_{i{{,}}k}^{s}}}{(D+{d}_{i{{,}}k}^{s}{\mathrm{\Delta }}_{\theta }{)}^{2}}{P}_{s}{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{k}}^{\mathrm{s}}\le {\mathrm{R}}_{\mathrm{c}}\\ 0{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{k}}^{\mathrm{s}} \gt {\mathrm{R}}_{\mathrm{c}}\end{array}\right.\end{array} $

(2)

where, P_s is the source power of IoTD.

Problem definition

In this subsection, we investigate the Minimizing Laser Chargers Coverage (MLCC) problem, whose goal is to minimize the number of LSCs deployed into the detection area while ensuring the energy of each IoTD in S is greater than or equal to E_T during the working period T_w.

We first define the binary variables $ {a}_{i}^{k}\left(\tau \right) $ and $ {b}_{i}^{j}\left(\tau \right) $ as follows.

$ \begin{array}{c}{a}_{i}^{k}\left(\tau \right)=\left\{\begin{array}{l}1{{,}}\quad {s}_{i}\; is\; charged\; by\;{s}_{k}\;at\; time\; slot\; \tau \\ 0{{,}}\quad otherwise\end{array}\right.\end{array} $

(3)

$ \begin{array}{c}{b}_{i}^{j}\left(\tau \right)=\left\{\begin{array}{l}1{{,}}\quad{s}_{i}\;is\;charged\;by\;{c}_{j}\;at\;time\;slot\;\tau \\ 0{{,}}\quad otherwise\end{array}\right.\end{array} $

(4)

Then, we give the expression for the remaining energy of $ {s}_{i}\in S $ at any time slot $ \tau > 0 $:

$ \begin{array}{c}{E}_{i}^{r}\left(\tau \right)={E}_{i}^{r}\left(\tau -1\right)+\sum _{k=1}^{n} {a}_{i}^{k}\left(\tau \right){P}_{s}^{k{{,}}i}l+\sum _{j=1}^{m} {b}_{i}^{j}\left(\tau \right){P}_{c}^{j{{,}}i}l-{\delta }_{i}l-\sum _{v=1}^{n} {a}_{v}^{i}\left(\tau \right){P}_{s}l \end{array} $

(5)

Finally, the MLCC problem can be mathematically formulated as:

$ \mathbb{P} :\mathrm{min}\;\;m $

(6)

s.t.

$ \begin{array}{l}\text{C}{1}:\; {E}_{i}^{r}\left(\tau \right)\ge {E}_{T}{{,}}\;\forall {s}_{i}\in S{{,}}\;\forall \tau \in T\\ \text{C}{2}{:}\; {{\Sigma }}_{k=1}^{n}{a}_{i}^{k}\left(\tau \right)+{{\Sigma }}_{j=1}^{m}{b}_{i}^{j}\left(\tau \right)\le 1{t{,}} \;\forall {s}_{i}\in S,\;\forall \tau \in T\\ \text{C}{3}{:}\; {{\Sigma }}_{i=1}^{n}{b}_{i}^{j}\left(\tau \right)\le 1{{,}}\; \forall {c}_{j}\in C,\;\forall \tau \in T\\ \text{C}{4}{:}\; {{\Sigma }}_{i=1}^{n}{a}_{i}^{k}\left(\tau \right)\le 1{{,}} \;\forall {s}_{k}\in S{{,}}\;\forall \tau \in T\end{array} $

(7)

where, the constraint C1 ensures the remaining energy of each IoTD is upper than or equal to E_T at any time slot $ \tau \in \mathcal{T} $, the constraint C2 limits that one IoTD can only be charged by one IoTD or one LSC at each time slot $ \tau \in \mathcal{T} $, the constraint C3 states one LSC can only charge one IoTD at every time slot $ \tau \in \mathcal{T} $ and the constraint C4 guarantees one IoTD can only charge another one IoTD at each time slot $ \tau \in \mathcal{T} $.

Theorem 1. The MLCC problem is NP-hard.

Proof. We consider a special case of the MLCC (scMLCC) problem with four conditions: (1) P_s = 0; (2) P_c = +∞; (3) for each $ {s}_{i}\in S $, E_M − δ_iT_w < E_T; (4) LSCs can only be deployed at the positions that coincide with IoTDs. Based on the above four conditions, the objective of the scMLCC problem is to minimize the number of LSCs, which can only be deployed at the locations coinciding with IoTDs, for coveraging all IoTDs to satisfy their charging requirements.

The decision version of the scMLCC problem, called K-scMLCC, is that given a set of IoTDs S and a positive integer K, does there exists a positive integer m (m ≤ K) such that m LSCs deployed at the locations coinciding with m IoTDs can cover all IoTDs?

To prove the scMLCC problem is NP-hard, we use the Minimum Dominating Set (MDS) problem for reduction, which has proven NP-hard^[25]. The decision version of MDS, called J-MDS, is defined as: given an undirected graph G(V, E) and a positive integer J, does there exists a subset $ {V}^{{'}}\subseteq V $ and $ \left|{V'}\right|\le J $ satisfying that for each node $ u\in V\setminus {V'} $, there must be at least one node $ v\in {V'} $ such that the edge $ \left(u{{,}}v\right)\in E $?

For given an instance of J-MDS, with G(V,E), V = {v₁, ···, v_n} and a positive integer J, we construct the instance of K-scMLCC problem as follows:

● K = J;

● |S| = |V|;

● each $ {s}_{i}\in S $ corresponds to node $ {v}_{i}\in V $ on graph G;

● any pair of s_i, $ {s}_{j}\in S $ are neighbors iff edge $ ({v}_{i}{{,}}{v}_{j})\in E $.

In the following, we will prove that J-MDS problem has a YES answer if K-scMLCC has a YES answer.

'Sufficiency'. Suppose m(m ≤ K) LSCs, whose locations coincide with$ m $ IoTDs, can coverage all IoTDs. Let $ {S'}\in S $ be the set of these m IoTDs and $ {V'}\subseteq V $ be the set of nodes on graph G corresponding to IoTDs in S'. Apparently, the subset $ {V'}\subseteq V $ is a dominating set in G and |V'| = |S'| = m ≤ K = J.

'Necessity'. Suppose $ {V}^{{'}}\subseteq V $ is the dominating set in G and $ \left|{V}^{{'}}\right|\le J $. Let $ {S}^{{'}}\subseteq S $ be the set of IoTDs corresponding to nodes in $ {V}^{{'}} $ on graph $ G $. Apparently, any $ {s}_{i}\in S\setminus {S}^{{'}} $ has at least one neighbor in S'. m(m = |S'|) LSCs deployed at the locations coinciding with the IoTDs in S' can coverage all IoTDs and m = |S '| = |V '| ≤ J = K.

Therefore, the scMLCC problem is NP-hard. Since the scMLCC problem is a special case of the MLCC problem, the MLCC problem is also NP-hard.

[1]	Gokhale P, Bhat O, Bhat S. 2018. Introduction to IOT. International Advanced Research Journal in Science, Engineering and Technology 5(1):41−44 Google Scholar
[2]	Ma W, Yang X, Tian Z. 2024. Agriculture neutralization: perspective from intelligent agricultural machinery. Circular Agricultural Systems 4:e002 doi: 10.48130/cas-0024-0002 CrossRef Google Scholar
[3]	Ullah Z, Rehman AU, Wang S, Hasanien HM, Luo P, et al. 2023. IoT-based monitoring and control of substations and smart grids with renewables and electric vehicles integration. Energy 282:128924 doi: 10.1016/j.energy.2023.128924 CrossRef Google Scholar
[4]	Palazzari V, Mezzanotte P, Alimenti F, Fratini F, Orecchini G, et al. 2017. Leaf compatible 'eco-friendly' temperature sensor clip for high density monitoring wireless networks. Wireless Power Transfer 4(1):55−60 doi: 10.1017/wpt.2017.1 CrossRef Google Scholar
[5]	Yang P, Abusafia A, Lakhdari A, Bouguettaya A. 2023. Monitoring efficiency of iot wireless charging. 2023 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops), 13−17 March 2023, Atlanta, GA, USA. USA: IEEE. pp. 306−8. doi: 10.1109/PerComWorkshops56833.2023.10150276
[6]	Xing Y, Pan H, Xu B, Tapparello C, Shi W, et al. 2021. Optimal Wireless Information and Power Transfer Using Deep Q-Network. Wireless Power Transfer 8:5513509 doi: 10.1155/2021/5513509 CrossRef Google Scholar
[7]	Fu Y, Mei H, Wang K, Yang K. 2021. Joint optimization of 3D trajectory and scheduling for solar-powered UAV systems. IEEE Transactions on Vehicular Technology 70(4):3972−77 doi: 10.1109/TVT.2021.3063310 CrossRef Google Scholar
[8]	Xie L, Shi Y, Hou YT, Lou W, Sherali HD, et al. 2014. Rechargeable sensor networks with magnetic resonant coupling. In Rechargeable Sensor Networks: Technology, Theory, and Application, eds. Chen J, He S, Sun Y. World Scientific. pp. 31−68. doi: 10.1142/9789814525466_0002
[9]	Sharma H, Haque A, Jaffery ZA. 2018. Solar energy harvesting wireless sensor network nodes: A survey. Journal of Renewable and Sustainable Energy 10(2):023704 doi: 10.1063/1.500661 CrossRef Google Scholar
[10]	Sharma H, Haque A, Jaffery ZA. 2018. An efficient solar energy harvesting system for wireless sensor nodes. 2018 2^nd IEEE International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES), 22−24 October 2018, Delhi, India. USA: IEEE. pp. 461−64. doi: 10.1109/ICPEICES.2018.8897434
[11]	Ram SK, Chourasia S, Das BB, Swain AK, Mahapatra K, et al. 2020. A solar based power module for battery-less IoT sensors towards sustainable smart cities. 2020 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 6−8 July 2020, Limassol, Cyprus. USA: IEEE. pp. 458−63. doi: 10.1109/ISVLSI49217.2020.00-14
[12]	Yang J, Zhu K, Zhu X, Wang J. 2021. Learning-based aerial charging scheduling for UAV-based data collection. Wireless Algorithms, Systems, and Applications: 16^th International Conference, WASA 2021, Nanjing, China, June 25–27, 2021, Proceedings, Part II 16. Cham: Springer International Publishing. pp. 600−11. doi: 10.1007/978-3-030-86130-8_47
[13]	Rathod Y, Hughes L. 2019. Simulating the charging of electric vehicles by laser. Procedia Computer Science 155:527−34 doi: 10.1016/j.procs.2019.08.073 CrossRef Google Scholar
[14]	Luo C, Liu N, Hou Y, Hong Y, Chen Z, et al. 2023. Trajectory optimization of laser-charged UAV to minimize the average age of information for wireless rechargeable sensor network. Theoretical Computer Science 2023 945:113680 doi: 10.1016/j.tcs.2022.12.030 CrossRef Google Scholar
[15]	Zhang L, Wang Y, Min M, Guo C, Sharma V, et al. 2023. Privacy-aware laser wireless power transfer for aerial multi-access edge computing: a colonel blotto game approach. IEEE Internet of Things Journal 10(7):5923−39 doi: 10.1109/JIOT.2022.3167052 CrossRef Google Scholar
[16]	Liao JH, Jiang JR. 2014. Wireless charger deployment optimization for wireless rechargeable sensor networks. 2014 7^th International Conference on Ubi-Media Computing and Workshops, 12−14 July 2014, Ulaanbaatar, Mongolia. USA: IEEE. pp. 160−64. doi: 10.1109/U-MEDIA.2014.72
[17]	Chen YC, Jiang JR. 2016. Particle swarm optimization for charger deployment in wireless rechargeable sensor networks. 2016 26^th International Telecommunication Networks and Applications Conference (ITNAC), 7−9 December 2016, Dunedin, New Zealand. USA: IEEE. pp. 231−36. doi: 10.1109/ATNAC.2016.7878814
[18]	Chien WC, Cho HH, Chao HC, Shih TK. 2016. Enhanced SA-based charging algorithm for WRSN. 2016 International Wireless Communications and Mobile Computing Conference (IWCMC), 5−9 September 2016, Paphos, Cyprus. USA: IEEE. pp. 1012−17. doi: 10.1109/IWCMC.2016.7577197
[19]	Li M, Liu L, Wang Y, Peng J, Xi J, et al. 2021. Efficient wireless static chargers deployment for UAV networks. 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), 30 September 2021 − 3 October 2021, New York City, NY, USA. USA: IEEE. pp. 1483−90. doi: 10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00200
[20]	Liu H, Zhong L, Liu Z, Lin F. 2022. A multi-objective genetic optimization algorithm for charger selection in static charger deployment scheme for WRSN. 2022 IEEE 14^th International Conference on Advanced Infocomm Technology (ICAIT), 8-11 July 2022, Chongqing, China. USA: IEEE. pp. 230−35. doi: 10.1109/ICAIT56197.2022.9862698
[21]	Lin T L, Chang H Y, Wang Y H. 2020. A novel hybrid search and remove strategy for power balance wireless charger deployment in wireless rechargeable sensor networks. Energies 13(10):2661 doi: 10.3390/en13102661 CrossRef Google Scholar
[22]	You W, Ren M, Ma Y, Wu D, Yang J, et al. 2023. Practical charger placement scheme for wireless rechargeable sensor networks with obstacles. ACM Transactions on Sensor Networks 20(1):1−23 doi: 10.1145/3614431 CrossRef Google Scholar
[23]	Zhang Q, Fang W, Liu Q, Wu J, Xia P, et al. 2018. Distributed laser charging: A wireless power transfer approach. IEEE Internet of Things Journal 5(5):3853−64 doi: 10.1109/JIOT.2018.2851070 CrossRef Google Scholar
[24]	Lahmeri MA, Kishk MA, Alouini MS. 2020. Stochastic geometry-based analysis of airborne base stations with laser-powered UAVs. IEEE Communications Letters 24(1):173−77 doi: 10.1109/LCOMM.2019.2947039 CrossRef Google Scholar
[25]	Hartmanis J. 1982. Computers and intractability: a guide to the theory of NP-completeness (Michael R. Garey and David S. Johnson). SIAM Review 24(1):90−91 doi: 10.1137/1024022 CrossRef Google Scholar
[26]	Littman ML. 1994. Markov games as a framework for multi-agent reinforcement learning. In Machine Learning Proceedings 1994, eds. Cohen WW, Hirsh H. USA: Morgan Kaufmann. pp. 157−63. doi: 10.1016/B978-1-55860-335-6.50027-1
[27]	Zhang K, Liu Y, Liu J, Liu M, Başar T. 2020. Distributed learning of average belief over networks using sequential observations. Automatica 115:108857 doi: 10.1016/j.automatica.2020.108857 CrossRef Google Scholar
[28]	Lowe R, Wu YI, Tamar A, Harb J, Abbeel P, et al. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. Advances in Neural Information Processing Systems 30 (NIPS 2017), 4-9 Dec 2017, Long Beach, USA. https://proceedings.neurips.cc/paper_files/paper/2017
[29]	van de Velden M, D’Enza AI, Markos A. 2019. Distance-based clustering of mixed data. Wiley Interdisciplinary Reviews: Computational Statistics 11(3):e1456 doi: 10.1002/wics.1456 CrossRef Google Scholar

Input: The number of LSCs m, the set C of m LSCs, the set S of n loTDs, time slot length l. The set $ \mathcal{T} $ of $ \Gamma $ time slots;
Output: T_c;
1:	Initialize T_c = 0;
2:	For each $ {s}_{i}\in S $, compute $ {\mathcal{N}}_{i}^{c}=\{{c}_{j}\|{d}_{i,j}^{c}\le {R}_{c}\} $;
3:	For each $ {s}_{i}\in S $, compute $ l{y}_{i} $ by using Dijkstra algorithm;
4:	for $ \tau $ from $ 1 $ to $ \mathrm{\Gamma } $ do
5:	For each $ {s}_{i}\in S $, let $ {E}_{i}^{r}(\tau )={E}_{i}^{r}(\tau -1)-{\delta }_{i}l $.
6:	Sort all loTDs and obtain $ {S}_{o}=\{{s}_{{\rho }_{1}},\cdots ,{s}_{{\rho }_{n}}\} $;
7:	for i from 1 to n do
8:	${\rm{Let}}\; {C}_{{\rho }_{i}}=\{{c}_{j}\|{c}_{j}\in {\mathcal{N}}_{{\rho }_{i}}^{c}\wedge {\mathrm{\Sigma }}_{q=1}^{n}{b}_{q}^{j}(\tau )=0\} ;$
9:	$ {{\rm{Let}}\, {S}_{{\rho }_{i}}^{u} = \{{s}_{k}\|{s}_{k}\in {\mathcal{N}}_{{\rho }_{i}}^{s} \wedge l{y}_{k} = l{y}_{{\rho }_{i}} - 1\wedge {\mathrm{\Sigma }}_{q=1}^{n}{a}_{q}^{k}(\tau ) = 0 \wedge {E}_{k}^{r}(\tau ) - {P}_{s}l > {E}_{{\rho }_{i}}^{r}(\tau)\} ;}$
10:	${\rm{Let}}\; {S}_{{\rho }_{i}}^{e}=\{{s}_{k}\|{s}_{k}\in {\mathcal{N}}_{{\rho }_{i}}^{s}\wedge l{y}_{k}=l{y}_{{\rho }_{i}}\wedge {\mathrm{\Sigma }}_{q=1}^{n}{a}_{q}^{k}(\tau )=0\wedge {E}_{k}^{r}(\tau )-{P}_{s}l > {E}_{{\rho }_{i}}^{r}(\tau )\} ;$
11:	if $ {C}_{{\rho }_{i}}\ne \mathrm{\varnothing } $ then
12:	$ {c}_{v}=\mathrm{arg}min\{{d}_{{\rho }_{i},v}^{c}\|{c}_{v}\in {C}_{{\rho }_{i}}\} $;
13:	$ {b}_{{\rho }_{i}}^{v}(\tau )=1 $, $ {E}_{{\rho }_{i}}^{r}(\tau )=min{\{}{E}_{M},{E}_{{\rho }_{i}}^{r}(\tau )+{P}_{c}^{v,{\rho }_{i}}l{\}} $;
14:	else if $ {S}_{{\rho }_{i}}^{u}\ne \mathrm{\varnothing } $ then
15:	$ {s}_{v}=\mathrm{arg}max\{{E}_{v}^{r}(\tau )\|{s}_{v}\in {S}_{{\rho }_{i}}^{u}\} $ ;
16:	$ {E}_{{\rho }_{i}}^{r}(\tau )={E}_{{\rho }_{i}}^{r}(\tau )+{P}_{s}^{v,{\rho }_{i}}l $;
17:	$ {a}_{{\rho }_{i}}^{v}(\tau )=1,{E}_{v}^{r}(\tau )={E}_{v}^{r}(\tau )-{P}_{s}l $;
18:	else if $ {S}_{{\rho }_{i}}^{e}\ne \mathrm{\varnothing } $ then
19:	$ {s}_{v}=\mathrm{arg}max\{{E}_{v}^{r}(\tau )\|{s}_{v}\in {S}_{{\rho }_{i}}^{e}\} $ ;
20:	$ {E}_{{\rho }_{i}}^{r}(\tau )={E}_{{\rho }_{i}}^{r}(\tau )+{P}_{s}^{v,{\rho }_{i}}l $ ;
21:	$ {a}_{{\rho }_{i}}^{v}(\tau )=1,{E}_{v}^{r}(\tau )={E}_{v}^{r}(\tau )-{P}_{s}l $;
22:	end if
23:	Let $ {S}_{o}={S}_{o}{\setminus }\{{s}_{{\rho }_{i}}\} $ and re-sort $ {S}_{o}=\{{s}_{{\rho }_{i+1}},{s}_{{\rho }_{i+2}},\cdots ,{s}_{{\rho }_{n}}\} $;
24:	end for
25:	if $ min{\{}{E}_{i}^{r}(\tau )\|1\le i\le n{\}} < {E}_{T} $ then
26:	break;
27:	end if
28:	$ {T}_{c}=l\tau $;
29:	end for
30:	return T_c;

Input: The number of episodes N_e, time steps T, a small constant $ \in $, the number of LSCs m, the set S of loTDs, the number of grids L × L;
Output: Positions of m LSCs, Tc;
1:	for each $ {c}_{j}\in C $ do
2:	Initialize parameters of actor and target network $ {\theta }_{j}^{\mu } $ and $ {\theta }_{j}^{{\mu }'} $;
3:	Initialize parameters of actor and target network $ {\theta }_{j}^{Q} $ and $ {\theta }_{j}^{{Q}'} $;
4:	end for
5:	Initially place m agents in the center of the intermediate grid;
6:	Obtain $ t{f}^{0} $ by calling Algorithm 1;
7:	Clear out the replay buffer $ D $;
8:	for episode from 1 to N_e do
9:	Reset the positions of agents;
10:	Initialize the exploration noise $ \mathcal{N} $;
11:	for time step t from 1 to T do
12:	for agent j from 1 to m do
13:	Get the observation $ {o}_{j}^{t} $;
14:	Choose action $ {a}_{j}^{t}={\mu }_{j}({o}_{j}^{t};{\theta }_{j}^{\mu })+\mathcal{N} $;
15:	end for
16:	Execute joint actions $ {\mathit{a}}^{\mathit{t}}= \{{a}_{1}^{t},... ,{a}_{m}^{t}\} $ of all agents;
17:	Execute Algorithm l to obtain $ t{f}^{t} $;
18:	Obtain reward $ {\mathit{r}}^{\mathit{t}} $ and next observations $ {\mathit{o}}^{\mathit{t}+1} $;
19:	Store $ ({\mathbf{o}}^{\mathbf{t}},{\mathbf{a}}^{\mathbf{t}},{\mathbf{r}}^{\mathbf{t}},{\mathbf{o}}^{\mathbf{t}+1}) $ in replay buffer $ D $;
20:	Select random transitions $ \mathcal{K} $ from $ D $;
21:	for agent j from 1 to m do
22:	Update $ {\theta }_{j}^{\mu } $ and $ {\theta }_{j}^{Q} $ by equation (11) and (12);
23:	Update $ {\theta }_{j}^{{\mu '}} $ and $ {\theta }_{j}^{{Q'}} $ by equation (13) and (14);
24:	end for
25:	end for
26:	end for
27:	Reset the positions of agents;
28:	for time step t from 1 to T do
29:	for agent j from 1 to m do
30:	Get the observation $ {o_j^t} $;
31:	Obtain the action $ {a}_j^t={\mu }_{j}({o}_{j}^{t};{\theta }_{j}^{\mu }) $;
32:	Execute the action $ {a}_{j}^{t} $;
33:	end for
34:	end for
35:	Execute Algorithm l to obtain T_c;
36:	return positions of m agents and T_c;

Input: The set of IoTDs S;
Output: m;
1:	Divide A evenly into L × L grids;
2:	for m from 1 to n do
3:	Execute Algorithm 2 to deploy m LSCs and obtain T_c;
4:	if T_c == T_w then
5:	break;
6:	else
7:	Let m = m + 1;
8:	end if
9:	end for
10:	return m;

Parameters	Value	Parameters	Value
A	[200 m, 200 m]²	N_e	6,000
L	20	T	20
E_T	200 J	$ \left\|\mathcal{D}\right\| $	25,600
$ l $	1 s	$ \left\|\mathcal{K}\right\| $	256
$ {\delta }_{s} $	10⁻⁵	$ \epsilon $	0.005
$ {\eta }_{el} $	0.3	$ \gamma $	0.95
$ \omega {A}_{c}\chi $	0.004 m²	$ \mathcal{N} $	0.1
$ \alpha $	10⁻⁶ m	$ {P}_{b} $	50
$ D $	0.1 m	Layer type	Fully connected
$ {\mathrm{\Delta }}_{\theta } $	3.4 × 10⁻5	Optimizer	Adam
$ {\delta }_{i} $	[5 W, 20 W]

{{lists.name}}

Deployment optimization of laser chargers in self-organizing power transfer internet of things

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors