Search
2025 Volume 12
Article Contents
ARTICLE   Open Access    

Deployment optimization of laser chargers in self-organizing power transfer internet of things

More Information
  • Recently, the Internet of Things (IoT) has played an important role in many fields. Nevertheless, the fast and uneven energy consumption of IoT Devices (IoTDs) significantly limits the lifetime of IoT networks. One of the effective solutions is to deploy Laser Static Chargers (LSCs) to power IoTDs. However, deploying LSCs to cover all IoTDs will consume enormous costs. To prolong the lifetime of IoT and reduce the deployment costs of LSCs, in this paper, we first propose a novel IoT network named Self-organizing Power Transfer IoT with Laser Static Chargers (SPTIoT-LSC), where IoTDs are equipped with laser transmission and reception modules allowing energy transfer between IoTDs, and several LSCs are deployed into the network to charge IoTDs. Based on SPTIoT-LSC, we study the Minimizing Laser Chargers Coverage(MLCC) problem, which aims to minimize the number of LSCs deployed in SPTIoT-LSC while enabling all IoTDs to work continuously. Then we prove its NP-hardness. To solve the problem, we propose two sub-algorithms: the Layered Charging Scheduling Strategy (LCSS) algorithm and Deploy Chargers based on the Multi-agent deep deterministic policy gradient (DCM) algorithm to maximize the working time of IoTDs with given LSCs and corresponding positions and deploy given LSCs in SPTIoT-LSC, respectively. Based on the above sub-algorithms, we propose an approximation algorithm to solve the MLCC problem. Finally, extensive experiments are proposed to verify the efficiency of the proposed algorithm and the superiority of SPTIoT-LSC.
  • 加载中
  • [1] Gokhale P, Bhat O, Bhat S. 2018. Introduction to IOT. International Advanced Research Journal in Science, Engineering and Technology 5(1):41−44

    Google Scholar

    [2] Ma W, Yang X, Tian Z. 2024. Agriculture neutralization: perspective from intelligent agricultural machinery. Circular Agricultural Systems 4:e002 doi: 10.48130/cas-0024-0002

    CrossRef   Google Scholar

    [3] Ullah Z, Rehman AU, Wang S, Hasanien HM, Luo P, et al. 2023. IoT-based monitoring and control of substations and smart grids with renewables and electric vehicles integration. Energy 282:128924 doi: 10.1016/j.energy.2023.128924

    CrossRef   Google Scholar

    [4] Palazzari V, Mezzanotte P, Alimenti F, Fratini F, Orecchini G, et al. 2017. Leaf compatible 'eco-friendly' temperature sensor clip for high density monitoring wireless networks. Wireless Power Transfer 4(1):55−60 doi: 10.1017/wpt.2017.1

    CrossRef   Google Scholar

    [5] Yang P, Abusafia A, Lakhdari A, Bouguettaya A. 2023. Monitoring efficiency of iot wireless charging. 2023 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops), 13−17 March 2023, Atlanta, GA, USA. USA: IEEE. pp. 306−8. doi: 10.1109/PerComWorkshops56833.2023.10150276
    [6] Xing Y, Pan H, Xu B, Tapparello C, Shi W, et al. 2021. Optimal Wireless Information and Power Transfer Using Deep Q-Network. Wireless Power Transfer 8:5513509 doi: 10.1155/2021/5513509

    CrossRef   Google Scholar

    [7] Fu Y, Mei H, Wang K, Yang K. 2021. Joint optimization of 3D trajectory and scheduling for solar-powered UAV systems. IEEE Transactions on Vehicular Technology 70(4):3972−77 doi: 10.1109/TVT.2021.3063310

    CrossRef   Google Scholar

    [8] Xie L, Shi Y, Hou YT, Lou W, Sherali HD, et al. 2014. Rechargeable sensor networks with magnetic resonant coupling. In Rechargeable Sensor Networks: Technology, Theory, and Application, eds. Chen J, He S, Sun Y. World Scientific. pp. 31−68. doi: 10.1142/9789814525466_0002
    [9] Sharma H, Haque A, Jaffery ZA. 2018. Solar energy harvesting wireless sensor network nodes: A survey. Journal of Renewable and Sustainable Energy 10(2):023704 doi: 10.1063/1.500661

    CrossRef   Google Scholar

    [10] Sharma H, Haque A, Jaffery ZA. 2018. An efficient solar energy harvesting system for wireless sensor nodes. 2018 2nd IEEE International Conference on Power Electronics, Intelligent Control and Energy Systems (ICPEICES), 22−24 October 2018, Delhi, India. USA: IEEE. pp. 461−64. doi: 10.1109/ICPEICES.2018.8897434
    [11] Ram SK, Chourasia S, Das BB, Swain AK, Mahapatra K, et al. 2020. A solar based power module for battery-less IoT sensors towards sustainable smart cities. 2020 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 6−8 July 2020, Limassol, Cyprus. USA: IEEE. pp. 458−63. doi: 10.1109/ISVLSI49217.2020.00-14
    [12] Yang J, Zhu K, Zhu X, Wang J. 2021. Learning-based aerial charging scheduling for UAV-based data collection. Wireless Algorithms, Systems, and Applications: 16th International Conference, WASA 2021, Nanjing, China, June 25–27, 2021, Proceedings, Part II 16. Cham: Springer International Publishing. pp. 600−11. doi: 10.1007/978-3-030-86130-8_47
    [13] Rathod Y, Hughes L. 2019. Simulating the charging of electric vehicles by laser. Procedia Computer Science 155:527−34 doi: 10.1016/j.procs.2019.08.073

    CrossRef   Google Scholar

    [14] Luo C, Liu N, Hou Y, Hong Y, Chen Z, et al. 2023. Trajectory optimization of laser-charged UAV to minimize the average age of information for wireless rechargeable sensor network. Theoretical Computer Science 2023 945:113680 doi: 10.1016/j.tcs.2022.12.030

    CrossRef   Google Scholar

    [15] Zhang L, Wang Y, Min M, Guo C, Sharma V, et al. 2023. Privacy-aware laser wireless power transfer for aerial multi-access edge computing: a colonel blotto game approach. IEEE Internet of Things Journal 10(7):5923−39 doi: 10.1109/JIOT.2022.3167052

    CrossRef   Google Scholar

    [16] Liao JH, Jiang JR. 2014. Wireless charger deployment optimization for wireless rechargeable sensor networks. 2014 7th International Conference on Ubi-Media Computing and Workshops, 12−14 July 2014, Ulaanbaatar, Mongolia. USA: IEEE. pp. 160−64. doi: 10.1109/U-MEDIA.2014.72
    [17] Chen YC, Jiang JR. 2016. Particle swarm optimization for charger deployment in wireless rechargeable sensor networks. 2016 26th International Telecommunication Networks and Applications Conference (ITNAC), 7−9 December 2016, Dunedin, New Zealand. USA: IEEE. pp. 231−36. doi: 10.1109/ATNAC.2016.7878814
    [18] Chien WC, Cho HH, Chao HC, Shih TK. 2016. Enhanced SA-based charging algorithm for WRSN. 2016 International Wireless Communications and Mobile Computing Conference (IWCMC), 5−9 September 2016, Paphos, Cyprus. USA: IEEE. pp. 1012−17. doi: 10.1109/IWCMC.2016.7577197
    [19] Li M, Liu L, Wang Y, Peng J, Xi J, et al. 2021. Efficient wireless static chargers deployment for UAV networks. 2021 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom), 30 September 2021 − 3 October 2021, New York City, NY, USA. USA: IEEE. pp. 1483−90. doi: 10.1109/ISPA-BDCloud-SocialCom-SustainCom52081.2021.00200
    [20] Liu H, Zhong L, Liu Z, Lin F. 2022. A multi-objective genetic optimization algorithm for charger selection in static charger deployment scheme for WRSN. 2022 IEEE 14th International Conference on Advanced Infocomm Technology (ICAIT), 8-11 July 2022, Chongqing, China. USA: IEEE. pp. 230−35. doi: 10.1109/ICAIT56197.2022.9862698
    [21] Lin T L, Chang H Y, Wang Y H. 2020. A novel hybrid search and remove strategy for power balance wireless charger deployment in wireless rechargeable sensor networks. Energies 13(10):2661 doi: 10.3390/en13102661

    CrossRef   Google Scholar

    [22] You W, Ren M, Ma Y, Wu D, Yang J, et al. 2023. Practical charger placement scheme for wireless rechargeable sensor networks with obstacles. ACM Transactions on Sensor Networks 20(1):1−23 doi: 10.1145/3614431

    CrossRef   Google Scholar

    [23] Zhang Q, Fang W, Liu Q, Wu J, Xia P, et al. 2018. Distributed laser charging: A wireless power transfer approach. IEEE Internet of Things Journal 5(5):3853−64 doi: 10.1109/JIOT.2018.2851070

    CrossRef   Google Scholar

    [24] Lahmeri MA, Kishk MA, Alouini MS. 2020. Stochastic geometry-based analysis of airborne base stations with laser-powered UAVs. IEEE Communications Letters 24(1):173−77 doi: 10.1109/LCOMM.2019.2947039

    CrossRef   Google Scholar

    [25] Hartmanis J. 1982. Computers and intractability: a guide to the theory of NP-completeness (Michael R. Garey and David S. Johnson). SIAM Review 24(1):90−91 doi: 10.1137/1024022

    CrossRef   Google Scholar

    [26] Littman ML. 1994. Markov games as a framework for multi-agent reinforcement learning. In Machine Learning Proceedings 1994, eds. Cohen WW, Hirsh H. USA: Morgan Kaufmann. pp. 157−63. doi: 10.1016/B978-1-55860-335-6.50027-1
    [27] Zhang K, Liu Y, Liu J, Liu M, Başar T. 2020. Distributed learning of average belief over networks using sequential observations. Automatica 115:108857 doi: 10.1016/j.automatica.2020.108857

    CrossRef   Google Scholar

    [28] Lowe R, Wu YI, Tamar A, Harb J, Abbeel P, et al. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. Advances in Neural Information Processing Systems 30 (NIPS 2017), 4-9 Dec 2017, Long Beach, USA. https://proceedings.neurips.cc/paper_files/paper/2017
    [29] van de Velden M, D’Enza AI, Markos A. 2019. Distance-based clustering of mixed data. Wiley Interdisciplinary Reviews: Computational Statistics 11(3):e1456 doi: 10.1002/wics.1456

    CrossRef   Google Scholar

  • Cite this article

    Hu J, Luo C, Hong Y, Wang G, Fan X, et al. 2025. Deployment optimization of laser chargers in self-organizing power transfer internet of things. Wireless Power Transfer 12: e010 doi: 10.48130/wpt-0025-0004
    Hu J, Luo C, Hong Y, Wang G, Fan X, et al. 2025. Deployment optimization of laser chargers in self-organizing power transfer internet of things. Wireless Power Transfer 12: e010 doi: 10.48130/wpt-0025-0004

Figures(8)  /  Tables(4)

Article Metrics

Article views(2141) PDF downloads(471)

ARTICLE   Open Access    

Deployment optimization of laser chargers in self-organizing power transfer internet of things

Wireless Power Transfer  12 Article number: e010  (2025)  |  Cite this article

Abstract: Recently, the Internet of Things (IoT) has played an important role in many fields. Nevertheless, the fast and uneven energy consumption of IoT Devices (IoTDs) significantly limits the lifetime of IoT networks. One of the effective solutions is to deploy Laser Static Chargers (LSCs) to power IoTDs. However, deploying LSCs to cover all IoTDs will consume enormous costs. To prolong the lifetime of IoT and reduce the deployment costs of LSCs, in this paper, we first propose a novel IoT network named Self-organizing Power Transfer IoT with Laser Static Chargers (SPTIoT-LSC), where IoTDs are equipped with laser transmission and reception modules allowing energy transfer between IoTDs, and several LSCs are deployed into the network to charge IoTDs. Based on SPTIoT-LSC, we study the Minimizing Laser Chargers Coverage(MLCC) problem, which aims to minimize the number of LSCs deployed in SPTIoT-LSC while enabling all IoTDs to work continuously. Then we prove its NP-hardness. To solve the problem, we propose two sub-algorithms: the Layered Charging Scheduling Strategy (LCSS) algorithm and Deploy Chargers based on the Multi-agent deep deterministic policy gradient (DCM) algorithm to maximize the working time of IoTDs with given LSCs and corresponding positions and deploy given LSCs in SPTIoT-LSC, respectively. Based on the above sub-algorithms, we propose an approximation algorithm to solve the MLCC problem. Finally, extensive experiments are proposed to verify the efficiency of the proposed algorithm and the superiority of SPTIoT-LSC.

    • Internet of Things (IoT) is the network of physical objects, encompassing sensors, vehicles, instruments, and so on. This interconnected web enables these devices to gather and exchange data[1]. Boasting attributes like low cost, convenient deployment, and self-organization of IoT Devices (IoTDs), IoT networks have been applied in various fields, including agricultural production, smart cities, and environmental monitoring[24], etc.

      While IoTDs have clear advantages, their practical applications also face significant challenges. Specifically, IoTDs' limited battery capacity restricts their ability to operate sustainably. The rapid and uneven energy depletion of IoTDs significantly hinders the widespread practical utilization of IoT networks. To address this limitation, one of the effective solutions is to deploy Laser Static Chargers (LSCs) to power IoTDs by using laser charging, a kind of Wireless Power Transfer (WPT) technique[57]. However, deploying LSCs to cover all IoTDs to supply sufficient energy will consume enormous costs. Motivated by the above analysis, to prolong the lifetime of IoT while reducing the deployment costs of LSCs, we first propose a novel network framework called Self-organizing Power Transfer Internet of Things (SPTIoT), where each IoTD is equipped with a laser transmission module and a laser reception module, enabling laser energy transferred between IoTDs, as shown in Fig. 1. The SPTIoT network consists of several IoT devices with rechargeable batteries. All IoTDs form a self-organizing power transfer network by transmitting laser energy to each other in their neighbourhood. In the network, each IoTD can be used both as a receiver to receive energy from other IoTDs within its neighbourhood and as a charger to charge neighbour IoTDs. As a receiver, the laser reception module of the IoTD receives laser energy, converts the laser energy into electrical energy, and stores it in rechargeable batteries. As a charger, the laser transmission module of the IoTD converts its electrical energy into laser energy and transmits it to other IoTDs. Unlike general IoTs, the SPTIoT distinguishes itself by forming a self-organizing power transfer network, where IoTDs with lower energy can be charged by the IoTDs within their neighbourhood. This innovative architecture not only helps balance the energy distribution across IoTDs and extend the overall lifespan of the network but also significantly reduces the number of LSCs required, because covering a portion of IoTDs can meet the energy requirements of all IoTDs in SPTIoT.

      Figure 1. 

      Architecture of the SPTIoT network.

      Then, based on the SPTIoT, comprehensively considering the energy requirements of IoTDs and deployment costs of LSCs, we study the Minimizing Laser Chargers Coverage (MLCC) problem. The objective of the problem is to minimize the number of LSCs deployed in SPTIoT while ensuring that all IoTDs can work continuously.

      The contributions of this paper can be summarized as follows:

      (1) We propose a novel network architecture called the Self-organizing Power Transfer Internet of Things (SPTIoT), allowing energy transfer between IoTDs. We propose the concept of self-organizing wireless charging network for the first time;

      (2) We first identify the SPTIoT with LSCs (SPTIoT-LSC) network model, where LSCs are deployed in the network to replenish energy for IoTDs. Based on SPTIoT-LSC, we study the Minimizing Laser Chargers Coverage (MLCC) problem, whose objective is to minimize the number of LSCs deployed in the network while enabling all IoTDs to work continuously. Then we prove that the MLCC problem is NP-hard;

      (3) To solve the MLCC problem, we first propose a sub-problem, Charging Scheduling Problem (CSP), to maximize the working time of IoTDs with the given LSCs and their positions. Then the Layered Charging Scheduling Strategy (LCSS) algorithm is proposed to solve the CSP problem. After that, the Deploy Chargers based on Multi-agent deep deterministic policy gradient (DCM) algorithm is employed to deploy the given LSCs based on the CSP problem. Finally, we propose an approximation algorithm MLCC Algorithm (MLCCA) to solve the MLCC problem based on the above two algorithms;

      (4) Extensive simulations are presented to demonstrate the efficiency of the proposed algorithm for the MLCC problem as well as the superiority of the SPTIoT compared with general IoT.

    • This section discusses relevant research and puts forward the differences between previous literature and this paper. We classify the investigated problems into two types: wireless charging in IoT, and deployment optimization of static chargers.

    • Commonly used wireless charging methods include magnetic resonance coupling, solar charging, laser charging, etc. In the study by Xie et al.[8], they utilized a wireless charging vehicle with the magnetic resonant coupling charging method to provide periodic recharging for sensors in a wireless sensor network. However, magnetic resonant coupling technology suffers from limited charging distances, facing the problems of high deployment cost and low charging efficiency. In previous studies[911], solar charging was used to prolong the lifetime of IoTDs. However, the solar charging method is weather-dependent, and in bad weather or at night, IoTDs cannot be charged. Laser charging, in contrast, offers a reliable and stable energy supplyment, less susceptible to weather variations. The attenuation of laser beam transmission in air is very small[12], making it an effective long-distance charging method. In previous studies[13,14], laser charging technology was used to replenish energy for electric vehicles, overcoming the drawbacks including long charging times and short charging ranges. In a study by Fu et al.[7], an Unmanned Aerial Vehicle (UAV) was utilized to collect data from IoTDs and charge IoTDs using laser charging technology. By optimizing the UAV's trajectory, they maximized its residual energy while meeting all IoTDs' energy requirements. In the research of Zhang et al.[15], they used a laser-beamed WPT technology and presented a multitier tile grid-based spatial structure to charge Aerial User Equipment (AUE) in a high-altitude platform.

    • A large body of literature is dedicated to minimizing the deployment cost of static chargers in IoT networks while meeting the energy requirements of IoT devices. Liao & Jiang[16] studied the problem: minimizing the number of chargers deployed on grid points at a fixed height to make the wireless rechargeable sensor network sustainable. And then they presented two greedy algorithms to solve it. In the research of Chen & Jiang[17], a Particle Swarm Charger Deployment (PSCD) algorithm was proposed to deploy chargers by using the Particle Swarm Optimization (PSO) method. The results indicated its superiority compared with two greedy algorithms proposed by Liao & Jiang[16]. In the study by Chien et al.[18], the authors presented a layoff algorithm combining the Simulated Annealing-based (SA) method to optimize the chargers' positions in the WRSN. The proposed algorithm can reduce the number of chargers efficiently according to their simulation results. In the research of Li et al.[19], they studied how to efficiently deploy wireless static chargers in UAV networks. They presented a binary integer programming method to determine the minimum number of wireless static chargers as well as the location of each charger so that the energy requirements of UAVs are satisfied during flight. Liu et al.[20] investigated the problem of charging nodes with uncertain mobility by static chargers. They proposed a genetic-algorithm-based multi-objective optimization scheme to address the charger deployment problem. Lin et al.[21] proposed a hybrid search and removal strategy to discover the minimum number of chargers required to cover all sensor nodes. You et al.[22] studied a fundamental issue of wireless charger placement with obstacles and proposed a greedy algorithm for placing chargers. In summary, there are many articles studying the optimization of static charger deployment. However, the aforementioned studies neglected the energy transfer between devices, thereby necessitating the deployment of more chargers to ensure comprehensive coverage for IoTDs, which is more costly.

      It is clear from previous discussions that there are many studies on wireless charging in IoT and the deployment optimization of static chargers. However, they have not studied the energy transfer between IoTDs. Inspired by the above research, this paper, investigates the Minimize Laser Chargers Coverage problem in SPTIoT networks by considering laser charging technology, deployment optimization of chargers, and energy transfer between IoTDs, to minimize the number of LSCs deployed in SPTIoT and satisfy the energy requirements of all IoTDs.

    • In this section, we first introduce the network model. Then, we give the laser charging model for charging IoTDs. Finally, the formal definition of our problem is presented.

    • In this paper, we consider the network architecture of SPTIoT with Laser Static Chargers (SPTIoT-LSC), where a set of n IoTDs S = {s1, s2, ···, sn} is randomly located at a two-dimensional square detection area $ A\subseteq {\mathfrak{R}}^{2} $ for a monitoring mission and several LSCs are deployed into the area to replenish energy for IoTDs. In the network, each IoTD $ {s}_{i}\in S $ has a two-dimensional coordinate (xi, yi). All IoTDs have the uniform battery capacity EM and energy threshold ET. The IoTD stops working when its remaining energy is less than ET. Since IoTDs are in different positions and may take different tasks in the mission, each IoTD $ {s}_{i}\in S $ has its unique energy consumption rate δi.

      To satisfy the charging requirements of the network, multiple LSCs are deployed in the detection area to replenish energy for IoTDs, which can be collected in the set C = {c1, c2, ···, cm}. For simplicity, we assume that IoTDs and LSCs have the same laser transmission distance Rc. For any IoTD $ {s}_{i}\in S $, it can be charged by $ {c}_{j}\in C $ if and only if $ {d}_{i{{,}}j}^{c}\le {R}_{c} $, where $ {d}_{i{{,}}j}^{c} $ denotes the Euclidean distance between si and cj. Similarly, any IoTD $ {s}_{i}\in S $ can be charged by $ {s}_{k}\in S $ if and only if $ {d}_{i{{,}}k}^{s}\le {R}_{c} $, where $ {d}_{i{{,}}k}^{s} $ denotes the Euclidean distance between si and sk. Let $ {\mathcal{N}}_{i}^{s}=\left\{{s}_{k}\right|k\ne i{{,}}{d}_{i{{,}}k}^{s}\le {R}_{c}\} $ represent the set of neighbor IoTDs of $ {s}_{i}\in S $.

      Furthermore, let Tw represent the working period during which all IoTDs must remain continuously operational in SPTIoT-LSC. We discretize Tw evenly into $ \mathrm{\Gamma } $ time slots, with each slot having a duration of $ l=\dfrac{{T}_{w}}{\mathrm{\Gamma }} $. The set of time slots is denoted as $ \mathcal{T}=\{\mathrm{1{{,}\;}2}{{,}}\;\cdots{{,}}\;\mathrm{\Gamma }\} $. For any IoTD $ {s}_{i}\in S $, we utilize $ {E}_{i}^{r}\left(\tau \right) $ to signify its remaining energy at time slot $ \tau $, with the initial energy set as $ {E}_{i}^{r}\left(0\right)={E}_{M} $.

    • We consider the one-on-one laser charging model. We use a linear laser energy harvesting model[23,24] with efficiency ω to derive the energy transmission link from the cj to si. The power of si received from cj can be expressed as:

      $ \begin{array}{l}{P}_{c}^{j{{,}}i}=\left\{\begin{array}{l}\dfrac{\left(1-{\delta }_{s}\right){\eta }_{el}\omega {A}_{c}\chi {e}^{-\alpha {d}_{i{{,}}j}^{c}}}{(D+{d}_{i{{,}}j}^{c}{\mathrm{\Delta }}_{\mathrm{\theta }}{)}^{2}}{P}_{c}{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{j}}^{\mathrm{c}}\le {\mathrm{R}}_{\mathrm{c}}\\ 0{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{j}}^{\mathrm{c}} \gt {\mathrm{R}}_{\mathrm{c}}\end{array}\right.\end{array} $ (1)

      where, Pc is the source power of LSC, δs represents the the power splitting factor to separate the energy transfer link and communication link, $ {\eta }_{el} $ is the electricity-to-laser conversion efficiency, Ac is the area of the charging panel of each IoTD, $ \chi $ represents the optical efficiency of the combined transmission-receiver, α is the laser attenuation coefficient, $ D $ denotes the size of the initial laser beam, $ {\mathrm{\Delta }}_{\theta } $ represents the angular expansion of the laser beam.

      Similar to the laser charging model from LSC to IoTD, for any pair of $ {s}_{i}\in S $ and $ {s}_{k}\in S $, we can obtain the power of si received from sk as:

      $ \begin{array}{c}{P}_{s}^{k{{,}}i}=\left\{\begin{array}{l}\dfrac{\left(1-{\delta }_{s}\right){\eta }_{el}\omega {A}_{c}\chi {e}^{-a{d}_{i{{,}}k}^{s}}}{(D+{d}_{i{{,}}k}^{s}{\mathrm{\Delta }}_{\theta }{)}^{2}}{P}_{s}{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{k}}^{\mathrm{s}}\le {\mathrm{R}}_{\mathrm{c}}\\ 0{{,}}\quad{\mathrm{d}}_{\mathrm{i}{{,}}\mathrm{k}}^{\mathrm{s}} \gt {\mathrm{R}}_{\mathrm{c}}\end{array}\right.\end{array} $ (2)

      where, Ps is the source power of IoTD.

    • In this subsection, we investigate the Minimizing Laser Chargers Coverage (MLCC) problem, whose goal is to minimize the number of LSCs deployed into the detection area while ensuring the energy of each IoTD in S is greater than or equal to ET during the working period Tw.

      We first define the binary variables $ {a}_{i}^{k}\left(\tau \right) $ and $ {b}_{i}^{j}\left(\tau \right) $ as follows.

      $ \begin{array}{c}{a}_{i}^{k}\left(\tau \right)=\left\{\begin{array}{l}1{{,}}\quad {s}_{i}\; is\; charged\; by\;{s}_{k}\;at\; time\; slot\; \tau \\ 0{{,}}\quad otherwise\end{array}\right.\end{array} $ (3)
      $ \begin{array}{c}{b}_{i}^{j}\left(\tau \right)=\left\{\begin{array}{l}1{{,}}\quad{s}_{i}\;is\;charged\;by\;{c}_{j}\;at\;time\;slot\;\tau \\ 0{{,}}\quad otherwise\end{array}\right.\end{array} $ (4)

      Then, we give the expression for the remaining energy of $ {s}_{i}\in S $ at any time slot $ \tau > 0 $:

      $ \begin{array}{c}{E}_{i}^{r}\left(\tau \right)={E}_{i}^{r}\left(\tau -1\right)+\sum _{k=1}^{n} {a}_{i}^{k}\left(\tau \right){P}_{s}^{k{{,}}i}l+\sum _{j=1}^{m} {b}_{i}^{j}\left(\tau \right){P}_{c}^{j{{,}}i}l-{\delta }_{i}l-\sum _{v=1}^{n} {a}_{v}^{i}\left(\tau \right){P}_{s}l \end{array} $ (5)

      Finally, the MLCC problem can be mathematically formulated as:

      $ \mathbb{P} :\mathrm{min}\;\;m $ (6)

      s.t.

      $ \begin{array}{l}\text{C}{1}:\; {E}_{i}^{r}\left(\tau \right)\ge {E}_{T}{{,}}\;\forall {s}_{i}\in S{{,}}\;\forall \tau \in T\\ \text{C}{2}{:}\; {{\Sigma }}_{k=1}^{n}{a}_{i}^{k}\left(\tau \right)+{{\Sigma }}_{j=1}^{m}{b}_{i}^{j}\left(\tau \right)\le 1{t{,}} \;\forall {s}_{i}\in S,\;\forall \tau \in T\\ \text{C}{3}{:}\; {{\Sigma }}_{i=1}^{n}{b}_{i}^{j}\left(\tau \right)\le 1{{,}}\; \forall {c}_{j}\in C,\;\forall \tau \in T\\ \text{C}{4}{:}\; {{\Sigma }}_{i=1}^{n}{a}_{i}^{k}\left(\tau \right)\le 1{{,}} \;\forall {s}_{k}\in S{{,}}\;\forall \tau \in T\end{array} $ (7)

      where, the constraint C1 ensures the remaining energy of each IoTD is upper than or equal to ET at any time slot $ \tau \in \mathcal{T} $, the constraint C2 limits that one IoTD can only be charged by one IoTD or one LSC at each time slot $ \tau \in \mathcal{T} $, the constraint C3 states one LSC can only charge one IoTD at every time slot $ \tau \in \mathcal{T} $ and the constraint C4 guarantees one IoTD can only charge another one IoTD at each time slot $ \tau \in \mathcal{T} $.

      Theorem 1. The MLCC problem is NP-hard.

      Proof. We consider a special case of the MLCC (scMLCC) problem with four conditions: (1) Ps = 0; (2) Pc = +∞; (3) for each $ {s}_{i}\in S $, EM − δiTw < ET; (4) LSCs can only be deployed at the positions that coincide with IoTDs. Based on the above four conditions, the objective of the scMLCC problem is to minimize the number of LSCs, which can only be deployed at the locations coinciding with IoTDs, for coveraging all IoTDs to satisfy their charging requirements.

      The decision version of the scMLCC problem, called K-scMLCC, is that given a set of IoTDs S and a positive integer K, does there exists a positive integer m (mK) such that m LSCs deployed at the locations coinciding with m IoTDs can cover all IoTDs?

      To prove the scMLCC problem is NP-hard, we use the Minimum Dominating Set (MDS) problem for reduction, which has proven NP-hard[25]. The decision version of MDS, called J-MDS, is defined as: given an undirected graph G(V, E) and a positive integer J, does there exists a subset $ {V}^{{'}}\subseteq V $ and $ \left|{V'}\right|\le J $ satisfying that for each node $ u\in V\setminus {V'} $, there must be at least one node $ v\in {V'} $ such that the edge $ \left(u{{,}}v\right)\in E $?

      For given an instance of J-MDS, with G(V,E), V = {v1, ···, vn} and a positive integer J, we construct the instance of K-scMLCC problem as follows:

      K = J;

      ● |S| = |V|;

      ● each $ {s}_{i}\in S $ corresponds to node $ {v}_{i}\in V $ on graph G;

      ● any pair of si, $ {s}_{j}\in S $ are neighbors iff edge $ ({v}_{i}{{,}}{v}_{j})\in E $.

      In the following, we will prove that J-MDS problem has a YES answer if K-scMLCC has a YES answer.

      'Sufficiency'. Suppose m(mK) LSCs, whose locations coincide with$ m $ IoTDs, can coverage all IoTDs. Let $ {S'}\in S $ be the set of these m IoTDs and $ {V'}\subseteq V $ be the set of nodes on graph G corresponding to IoTDs in S'. Apparently, the subset $ {V'}\subseteq V $ is a dominating set in G and |V'| = |S'| = mK = J.

      'Necessity'. Suppose $ {V}^{{'}}\subseteq V $ is the dominating set in G and $ \left|{V}^{{'}}\right|\le J $. Let $ {S}^{{'}}\subseteq S $ be the set of IoTDs corresponding to nodes in $ {V}^{{'}} $ on graph $ G $. Apparently, any $ {s}_{i}\in S\setminus {S}^{{'}} $ has at least one neighbor in S'. m(m = |S'|) LSCs deployed at the locations coinciding with the IoTDs in S' can coverage all IoTDs and m = |S '| = |V '| ≤ J = K.

      Therefore, the scMLCC problem is NP-hard. Since the scMLCC problem is a special case of the MLCC problem, the MLCC problem is also NP-hard.

    • In this section, we propose an approximation algorithm called MLCC Algorithm (MLCCA) to solve the MLCC problem.

      Before introducing the MLCCA algorithm, we first study a sub-problem, the Charging Scheduling Problem (CSP), as shown in Definition 1, whose goal aims to provide a charging scheduling strategy of LSCs and IoTDs to maximize the working time of all IoTDs based on the given number and positions of LSCs. Then, we design the Layered Charging Scheduling Strategy (LCSS) algorithm to solve the sub-problem, which is a subroutine of the MLCCA.

      The MLCCA algorithm consists of two steps. Firstly, we divide the monitoring area A evenly into L × L grids. Secondly, we propose the Deploy Chargers based on Multi-agent deep deterministic policy gradient (DCM) algorithm to deploy the LSCs into the monitoring area A to charge the IoTDs by using the Multi-Agent Deep Reinforcement Learning (MADRL) method, where the output of the LCSS algorithm is utilized as one of the metrics of the reward function in the DCM algorithm.

      In the following, we first give a detailed description of the CSP problem.

      Definition 1 (CSP): Given a set S = {s1, s2, ···, sn} of n IoTDs, a set C = {c1, c2, ···, cm} of m LSCs, a set $ \mathcal{T}=\{\mathrm{1{{, }} \;2}{{, }} \;\cdots {{, }}\; \Gamma \} $ of time slots, working period Tw, the objective of Charging Scheduling Problem (CSP) is to maximize the working time Tc of IoTDs such that the remaining energy of each IoTD is greater than or equal to ET at every time slot $ \tau \in [1{{,}}\dfrac{{T}_{c}}{l}] $.

      The mathematical formulation of the CSP problem can be expressed as:

      $ \mathbb{P}:\max\;{T}_{c} $ (8)

      s.t.

      $ \begin{array}{l}\text{C}1:\; T_c\le T_w \\ \text{C}2:\; E_i^r\left(\tau\right)\ge E_T,\; \ \forall s_i\in S,\; \ \forall\tau\in[1,\; \dfrac{T_c}{l}] \\ \text{C}3:\; \Sigma_{k=1}^na_i^k\left(\tau\right)+\Sigma_{j=1}^mb_i^j\left(\tau\right)\le1,\; \ \forall s_i\in S,\; \ \forall\tau\in[1,\; \dfrac{T_c}{l}] \\ \text{C}4:\; \Sigma_{i=1}^nb_i^j\left(\tau\right)\le1t,\; \ \forall c_j\in C,\; \ \forall\tau\in[1t,\; \dfrac{T_c}{l}] \\ \text{C}5:\ \Sigma_{i=1}^na_i^k\left(\tau\right)\le1t,\; \ \forall s_k\in St,\; \ \forall\tau\in[1,\; \dfrac{T_c}{l}]\end{array} $ (9)

      where, the contsraint C1 states that Tc is less than or equal to the working periodTw, the constraint C2 ensures the remaining energy of each IoTD is greater than or equal to ET at any time slot $ \tau \in [1{{,}}\dfrac{{T}_{c}}{l}] $, the constraint C3 limits that any one IoTD can only be charged by one IoTD or one LSC at each time slot $ \tau \in [1{{,}}\dfrac{{T}_{c}}{l}] $, the constraint C4 states one LSC can only charge one IoTD at every time slot $ \tau \in [1{{,}}\dfrac{{T}_{c}}{l}] $, and the constraint C5 guarantees one IoTD can only charge another one IoTD at each time slot $ \tau \in [1{{,}}\dfrac{{T}_{c}}{l}] $.

    • In this subsection, we give a detailed description of the Layered Charging Scheduling Strategy (LCSS) algorithm, as shown in Algorithm 1.

      Table 1.  Layered charging scheduling strategy.

      Input: The number of LSCs m, the set C of m LSCs, the set S of n loTDs, time slot length l. The set $ \mathcal{T} $ of $ \Gamma $ time slots;
      Output: Tc;
      1: Initialize Tc = 0;
      2: For each $ {s}_{i}\in S $, compute $ {\mathcal{N}}_{i}^{c}=\{{c}_{j}|{d}_{i,j}^{c}\le {R}_{c}\} $;
      3: For each $ {s}_{i}\in S $, compute $ l{y}_{i} $ by using Dijkstra algorithm;
      4: for $ \tau $ from $ 1 $ to $ \mathrm{\Gamma } $ do
      5: For each $ {s}_{i}\in S $, let $ {E}_{i}^{r}(\tau )={E}_{i}^{r}(\tau -1)-{\delta }_{i}l $.
      6: Sort all loTDs and obtain $ {S}_{o}=\{{s}_{{\rho }_{1}},\cdots ,{s}_{{\rho }_{n}}\} $;
      7: for i from 1 to n do
      8: ${\rm{Let}}\; {C}_{{\rho }_{i}}=\{{c}_{j}|{c}_{j}\in {\mathcal{N}}_{{\rho }_{i}}^{c}\wedge {\mathrm{\Sigma }}_{q=1}^{n}{b}_{q}^{j}(\tau )=0\} ;$
      9: $ {{\rm{Let}}\, {S}_{{\rho }_{i}}^{u} = \{{s}_{k}|{s}_{k}\in {\mathcal{N}}_{{\rho }_{i}}^{s} \wedge l{y}_{k} = l{y}_{{\rho }_{i}} - 1\wedge {\mathrm{\Sigma }}_{q=1}^{n}{a}_{q}^{k}(\tau ) = 0 \wedge {E}_{k}^{r}(\tau ) - {P}_{s}l > {E}_{{\rho }_{i}}^{r}(\tau)\} ;}$
      10: ${\rm{Let}}\; {S}_{{\rho }_{i}}^{e}=\{{s}_{k}|{s}_{k}\in {\mathcal{N}}_{{\rho }_{i}}^{s}\wedge l{y}_{k}=l{y}_{{\rho }_{i}}\wedge {\mathrm{\Sigma }}_{q=1}^{n}{a}_{q}^{k}(\tau )=0\wedge {E}_{k}^{r}(\tau )-{P}_{s}l > {E}_{{\rho }_{i}}^{r}(\tau )\} ;$
      11: if $ {C}_{{\rho }_{i}}\ne \mathrm{\varnothing } $ then
      12: $ {c}_{v}=\mathrm{arg}min\{{d}_{{\rho }_{i},v}^{c}|{c}_{v}\in {C}_{{\rho }_{i}}\} $;
      13: $ {b}_{{\rho }_{i}}^{v}(\tau )=1 $, $ {E}_{{\rho }_{i}}^{r}(\tau )=min{\{}{E}_{M},{E}_{{\rho }_{i}}^{r}(\tau )+{P}_{c}^{v,{\rho }_{i}}l{\}} $;
      14: else if $ {S}_{{\rho }_{i}}^{u}\ne \mathrm{\varnothing } $ then
      15: $ {s}_{v}=\mathrm{arg}max\{{E}_{v}^{r}(\tau )|{s}_{v}\in {S}_{{\rho }_{i}}^{u}\} $ ;
      16: $ {E}_{{\rho }_{i}}^{r}(\tau )={E}_{{\rho }_{i}}^{r}(\tau )+{P}_{s}^{v,{\rho }_{i}}l $;
      17:
      $ {a}_{{\rho }_{i}}^{v}(\tau )=1,{E}_{v}^{r}(\tau )={E}_{v}^{r}(\tau )-{P}_{s}l $;
      18: else if $ {S}_{{\rho }_{i}}^{e}\ne \mathrm{\varnothing } $ then
      19: $ {s}_{v}=\mathrm{arg}max\{{E}_{v}^{r}(\tau )|{s}_{v}\in {S}_{{\rho }_{i}}^{e}\} $ ;
      20: $ {E}_{{\rho }_{i}}^{r}(\tau )={E}_{{\rho }_{i}}^{r}(\tau )+{P}_{s}^{v,{\rho }_{i}}l $ ;
      21: $ {a}_{{\rho }_{i}}^{v}(\tau )=1,{E}_{v}^{r}(\tau )={E}_{v}^{r}(\tau )-{P}_{s}l $;
      22: end if
      23: Let $ {S}_{o}={S}_{o}{\setminus }\{{s}_{{\rho }_{i}}\} $ and re-sort $ {S}_{o}=\{{s}_{{\rho }_{i+1}},{s}_{{\rho }_{i+2}},\cdots ,{s}_{{\rho }_{n}}\} $;
      24: end for
      25: if $ min{\{}{E}_{i}^{r}(\tau )|1\le i\le n{\}} < {E}_{T} $ then
      26: break;
      27: end if
      28: $ {T}_{c}=l\tau $;
      29: end for
      30: return Tc;

      Before describing the algorithm, we first introduce some notations. Let $ l{y}_{i} $ denote the minimum number of hops from the nearest LSC to si, which is named the layer number of si. Let $ {S}_{o}=\{{s}_{{\rho }_{1}}{{, }}\; {s}_{{\rho }_{2}}{{, }} \;\cdots {{, }} \;{s}_{{\rho }_{n}}\} $ be an ordered set of IoTDs arranged in ascending order according to their remaining energy. If more than one IoTD has the same remaining energy, then sort them in descending order by their layer numbers. If more than one IoTD has the same layer number, then sort them in ascending order by their subscript numbers.

      The LCSS algorithm consists of four steps.

      In the first step, we initialize Tc = 0.

      In the second step, for each $ {s}_{i}\in S $, we compute the set $ {\mathcal{N}}_{i}^{c}=\left\{{c}_{j}\right|{d}_{i{{,}}j}^{c}\le {R}_{c}\} $ of its neighbor LSCs.

      In the third step, we compute $ l{y}_{i} $ for each $ {s}_{i}\in S $, where the computation process is that for each $ {c}_{j}\in C $, we use the Dijkstra algorithm to compute the hops from cj to each IoTD, and then for each $ {s}_{i}\in S $, we select the smallest one as $ l{y}_{i} $ from all LSCs to si.

      In the fourth step, we schedule the charging process in the network, i.e., for any time slot $ \tau $, we update $ {E}_{i}^{r}\left(\tau \right) $ for each $ {s}_{i}\in S $ according to equation (5). At each time slot $ 1\le \tau \le \mathrm{\Gamma } $, the following substeps are executed:

      ● For each $ {s}_{i}\in S $, update the remaining energy $ {E}_{i}^{r}\left(\tau \right)={E}_{i}^{r}(\tau - 1)-{\delta }_{i}l $.

      ● We obtain $ {S}_{o}=\{{s}_{{\rho }_{1}}{{,}}\cdots {{,}}{s}_{{\rho }_{n}}\} $ by sorting all IoTDs.

      ● For each $ 1\le i\le n $, we execute the following steps to find the LSC or IoTD that can supply energy for $ {s}_{{\rho }_{i}} $ to charge $ {s}_{{\rho }_{i}} $. Firstly, we compute the set $ {C}_{{\rho }_{i}}=\left\{{c}_{j}\right|{c}_{j}\in {\mathcal{N}}_{{\rho }_{i}}^{c}\wedge {\mathrm{\Sigma }}_{q=1}^{n}{b}_{q}^{j}\left(\tau \right)=0\} $ of the LSCs that can charge $ {s}_{{\rho }_{i}} $ and the set $ {S}_{{\rho }_{i}}^{u}=\left\{{s}_{k}\right|{s}_{k}\in {\mathcal{N}}_{{\rho }_{i}}^{s}\wedge l{y}_{k}= l{y}_{{\rho }_{i}}-1\wedge {\mathrm{\Sigma }}_{q=1}^{n}{a}_{q}^{k}\left(\tau \right)=0\wedge {E}_{k}^{r}\left(\tau \right)-{P}_{s}l > {E}_{{\rho }_{i}}^{r}\left(\tau \right)\} $ of the IoTDs with layer number equal to $ l{y}_{{\rho }_{i}}-1 $ that can charge $ {s}_{{\rho }_{i}} $ and the set $ {S}_{{\rho }_{i}}^{e}= \left\{{s}_{k}\right|{s}_{k}\in{\mathcal{N}}_{{\rho }_{i}}^{s}\wedge l{y}_{k}=l{y}_{{\rho }_{i}}\wedge {\mathrm{\Sigma }}_{q=1}^{n}{a}_{q}^{k}\left(\tau \right)=0\wedge {E}_{k}^{r}\left(\tau \right)-{P}_{s}l > {E}_{{\rho }_{i}}^{r}\left(\tau \right)\} $ of the IoTDs with layer number equal to $ l{y}_{{\rho }_{i}} $ that can charge $ {s}_{{\rho }_{i}} $. Secondly, we find the LSC or IoTD to charge $ {s}_{{\rho }_{i}} $ in the light of three cases: (1) $ {C}_{{\rho }_{i}}\ne \mathrm{\varnothing } $, (2) $ {S}_{{\rho }_{i}}^{u}\ne \mathrm{\varnothing }\wedge {C}_{{\rho }_{i}}==\mathrm{\varnothing } $, and (3) $ {S}_{{\rho }_{i}}^{e}\ne \mathrm{\varnothing }\wedge {C}_{{\rho }_{i}}==\mathrm{\varnothing }\wedge {S}_{{\rho }_{i}}^{u}==\mathrm{\varnothing } $.

      (1) $ {C}_{{\rho }_{i}}\ne \mathrm{\varnothing } $.

      Let $ {c}_{v}=\mathrm{arg}min\left\{{d}_{{\rho }_{i}{{,}}v}^{c}\right|{c}_{v}\in {C}_{{\rho }_{i}}\} $. Update $ {b}_{{\rho }_{i}}^{v}\left(\tau \right)=1 $, $ {E}_{{\rho }_{i}}^{r}\left(\tau \right)=min\{{E}_{M}{{,}} {E}_{{\rho }_{i}}^{r}(\tau )+{P}_{c}^{v{{,}}{\rho }_{i}}l\} $.

      (2) $ {S}_{{\rho }_{i}}^{u}\ne \mathrm{\varnothing }\wedge {C}_{{\rho }_{i}}==\mathrm{\varnothing } $.

      Let $ {s}_{v}=\mathrm{arg}max\left\{{E}_{v}^{r}\right(\tau \left)\right|{s}_{v}\in {S}_{{\rho }_{i}}^{u}\} $. Update $ {a}_{{\rho }_{i}}^{v}\left(\tau \right)=1 $, $ {E}_{{\rho }_{i}}^{r}\left(\tau \right)= {E}_{{\rho }_{i}}^{r} \cdot \left(\tau \right)+ {P}_{s}^{v{{,}}{\rho }_{i}}l $ and $ {E}_{v}^{r}\left(\tau \right)={E}_{v}^{r}\left(\tau \right)-{P}_{s}l $.

      (3) $ {S}_{{\rho }_{i}}^{e}\ne \mathrm{\varnothing }\wedge {C}_{{\rho }_{i}}==\mathrm{\varnothing }\wedge {S}_{{\rho }_{i}}^{u}==\mathrm{\varnothing } $.

      Let $ {s}_{v}=\mathrm{arg}max\left\{{E}_{v}^{r}\right(\tau \left)\right|{s}_{v}\in {S}_{{\rho }_{i}}^{e}\} $. Update $ {a}_{{\rho }_{i}}^{v}\left(\tau \right)=1 $, $ {E}_{{\rho }_{i}}^{r}\left(\tau \right)={E}_{{\rho }_{i}}^{r}\cdot \left(\tau \right)+{P}_{s}^{v{{,}}{\rho }_{i}}l $ and $ {E}_{v}^{r}\left(\tau \right)={E}_{v}^{r}\left(\tau \right)-{P}_{s}l $.

      Thirdly, update $ {S}_{o}={S}_{o}\setminus \left\{{s}_{{\rho }_{i}}\right\} $ and re-sort $ {S}_{o}=\{{s}_{{\rho }_{i+1}}{{,}}{s}_{{\rho }_{i+2}}{{,}}\cdots {{,}}{s}_{{\rho }_{n}}\} $.

      ● If $ min\left\{{E}_{i}^{r}\right(\tau \left)\right|1\le i\le n\} < {E}_{T} $, then jump out of the loop, otherwise, let $ {T}_{c}=l\tau $.

      Finally, we can obtain the value of Tc.

      Theorem 2. Time complexity of the LCSS algorithm is $ O(\mathrm{\Gamma }{n}^{2}+m{n}^{2}) $.

      Proof. The primary time complexity of the LCSS algorithm involves the following calculations:

      Firstly, the computation of $ {\mathcal{N}}_{i}^{c} $ for each $ {s}_{i}\in S $ necessitates a running time of O(m). Consequently, the overall complexity of computing $ {\mathcal{N}}_{i}^{c} $ across all $ {s}_{i}\in S $ amounts to O(mn).

      Secondly, we repeatedly apply the Dijkstra algorithm $ m $ times to compute the minimum hops from each $ {c}_{j}\in C $ to each IoTD $ {s}_{i}\in S $, which contributes a complexity of O(mn2).

      Thirdly, we need $ \mathrm{\Gamma } $ iterations to update $ {E}_{i}^{r}\left(\tau \right) $ for each $ {s}_{i}\in S $ at any time slot $ 1\le \tau \le \mathrm{\Gamma } $. In the interior of the loop, first, we need O(nlogn) to obtain the ordered set So. Second, for any 1 ≤ i ≤ n, the computation of $ {C}_{{\rho }_{i}} $, $ {S}_{{\rho }_{i}}^{u} $ and $ {S}_{{\rho }_{i}}^{e} $ incurs a complexity of O(n(gc + 2gs)), where $ {g}_{c}=max\left\{\right|{\mathcal{N}}_{{\rho }_{i}}^{c}\left|\right|1\le i\le n\} $ and $ {g}_{s}= max\left\{\right|{\mathcal{N}}_{{\rho }_{i}}^{s}\left|\right|1\le i\le n\} $. Third, for any 1 ≤ i ≤ n, $ {s}_{{\rho }_{i}} $ is removed from So and So is re-sorted. Since only when $ {s}_{{\rho }_{i}} $ is charged by another IoTD sv, we need to re-sort So by moving sv to the correct position in So according to its remaining energy, the re-sorting process requires at most n steps and the time complexity is O(n2).

      Taken together, we can conclude that the time complexity of the LCSS algorithm is $ O(mn+mn^2+\mathrm{\Gamma}n(g_c+2g_s)+\mathrm{\Gamma}nlogn+\mathrm{\Gamma}n^2)= O(mn^2+\mathrm{\Gamma}n^2) $, since gcn and gsn.

    • Given m LSCs and L × L grids, we hope to deploy these LSCs into optimal grids of the detection area such that LSCs can serve IoTDs more effectively and IoTDs can work for a longer period. To this end, we formulate the deployment process of LSCs as a discrete-time Markov Game[26], employing the MADRL method to find a solution.

      We utilize the tuple $ (\mathcal{S}{{,}}\mathcal{A}{{,}}\mathcal{R}{{,}}\mathcal{P}{{,}}\gamma ) $ to represent the Markov Game for LSC deployment, where each LSC functions as an agent. $ \mathcal{S} $ is the global state space of the environment, containing the locations of all IoTDs and LSCs. In the initial state, m LSCs are placed in the center of the intermediate grid, which is the grid in row $ \lceil \dfrac{L}{2}\rceil $ and column $ \lceil \dfrac{L}{2}\rceil $.The set of joint actions $ \mathcal{A} $ comprises the potential actions of all agents. The reward functions of agents are captured in $ \mathcal{R} $. $ \mathcal{P} $ is the state transition probability from the current state to the next state, and $ \gamma \in \left[\mathrm{0{{,}}1}\right] $ is the discount factor. Additionally, due to the partial observability of the Markov Game[27], agents can only observe local information from the environment. We use observation space $ \mathcal{O} $ to represent the set of local information observed by agents.

      (1) Time Steps

      We set t = 1···T as the time steps of each episode in MADRL. Each episode has T time steps, and at every time step, each agent takes action to change its position by transferring to one of the adjacent grids. After T steps, the final deployment of LSCs is obtained. We set T = L to ensure that each agent can reach any grid in the area within T time steps.

      (2) Observation Space $ \mathcal{O} $

      We set $ {\mathbf{o}}^{\mathbf{t}}=\{{o}_{1}^{t}{{,}}\cdots {{,}}{o}_{m}^{t}\} $ to denote the joint observations of all agents at time step t. Since the IoTDs are placed beforehand and their positions no longer change, each agent obtains their location information in advance. Hence, the observation space $ {o}_{j}^{t} $ of j-th agent at step t is denoted as $ {o}_{j}^{t}=\left\{\right({x}_{j}^{t}{{,}}{y}_{j}^{t}){{,}}\{({x}_{i}{{,}}{y}_{i})|{s}_{i}\in S\}\} $, where $({x}_{j}^{t}{{,}}{y}_{j}^{t}) $ denotes the current coordinate of j-th agent at time step t.

      (3) Action Space $ \mathcal{A} $

      We set $ {\mathbf{a}}^{\mathbf{t}}=\{{a}_{1}^{t}{{, }}\cdots {{, }} {a}_{m}^{t}\} $ to denote the joint actions of all agents at time step t. Each agent can move to the center of one adjacent grid or keep still in any time step. Concretely, at any time step t, the action $ {a}_{j}^{t} $ of j-th agent can be expressed as a five-dimensional vector from the set {(0,0,0,0,1), (0,0,0,1,0), (0,0,1,0,0,), (0,1,0,0,0), (1,0,0,0,0)}. Each component of this vector corresponds to a specific movement direction:

      $ {a}_{j}^{t}=(\mathrm{0{{,}}0}{{,}}\mathrm{0{{,}}0}{{,}}1) $ indicates that j-th agent transfers to adjacent grid on the left.

      $ {a}_{j}^{t}=(\mathrm{0{{,}}0}{{,}}\mathrm{0{{,}}1}{{,}}0) $ represents that j-th agent moves to adjacent grid on the right.

      $ {a}_{j}^{t}=(\mathrm{0{{,}}0}{{,}}\mathrm{1{{,}}0}{{,}}0) $ signifies a transfer to the adjacent grid in front.

      $ {a}_{j}^{t}=(\mathrm{0{{,}}1}{{,}}\mathrm{0{{,}}0}{{,}}0) $ denotes that j-th agent moves to adjacent grid behind it.

      $ {a}_{j}^{t}=(\mathrm{1{{,}}0}{{,}}\mathrm{0{{,}}0}{{,}}0) $ means that j-th agent remains in its current grid.

      (4) Reward Function $ \mathcal{R} $

      At any time step t > 0, each agent takes action $ {a}_{j}^{t} $, moving to a new position. We obtain tft, which is equal to the output of Algorithm 1 when these agents are located in new positions after taking joint actions $ {\mathbf{a}}^{\mathbf{t}} $. At time step t = 0, tf0 is determined by executing Algorithm 1 when agents are in the initial positions.

      Then, the set of reward functions $ {\mathbf{r}}^{\mathbf{t}}=\{{r}_{1}^{t}{{,}}\cdots {{,}}{r}_{m}^{t}\} $ can be computed based on the value of tft. The reward function $ {r}_{j}^{t} $ of j-th agent at time step t is formulated as:

      $ \begin{array}{c}{r}_{j}^{t}=\dfrac{t{f}^{t}-t{f}^{0}-{\xi }_{j}^{t}{P}_{b}}{{T}_{w}} \end{array} $ (10)

      where, Pb is the punishment if the j-th agent moves out of the boundary of the area or collides with other agents or IoTDs, $ {\xi }_{j}^{t}\in \left\{\mathrm{0{{,}}1}\right\} $ with $ {\xi }_{j}^{t}=1 $ indicating that j-th agent accepts the punishment at time step t and $ {\xi }_{j}^{t}=0 $ otherwise.

    • In this subsection, we propose Deploy Chargers based on Multi-agent deep deterministic policy gradient (DCM) algorithm to deploy $ m $ LSCs in SPTIoT-LSC network. The Multi-Agent Deep Deterministic Policy Gradient (MADDPG)[28] algorithm is one of the MADRL methods with the paradigm of centralized training and decentralized execution. In the centralized training phase, agents use joint observations and actions of all agents to train the neural networks. In the decentralized execution phase, each agent only uses local observation to obtain its action.

      We first introduce the fundamental components and corresponding notations of MADDPG, encompassing the actor networks, critic networks, target networks, and the replay buffer. The framework of MADDPG is shown in Fig. 2. Each agent learns a $ Q $ value function serving as a critic and a policy function acting as an actor. The actor of j-th agent is built via a neural network (denoted as its parameters $ {\theta }_{j}^{\mu } $). The actor network $ {\theta }_{j}^{\mu } $ of j-th agent generates the action $ {a}_{j}^{t}={\mu }_{j}({o}_{j}^{t}{{;}}{\theta }_{j}^{\mu }) $ based on the input observation $ {o}_{j}^{t} $ at each time step t, where $ {\mu }_{j}({o}_{j}^{t}{{;}}{\theta }_{j}^{\mu }) $ is the output of the actor network. Similarly, the critic of j-th agent is also built via a neural network (denoted as its parameters $ {\theta }_{j}^{Q} $). The critic network $ {\theta }_{j}^{Q} $ of j-th agent gives the estimated $ {Q}_{j}({\mathbf{o}}^{\mathbf{t}}{{,}}{\mathbf{a}}^{\mathbf{t}}{{;}}{\theta }_{j}^{Q}) $ value based on the input joint observations $ {\mathbf{o}}^{\mathbf{t}} $ and joint actions $ {\mathbf{a}}^{\mathbf{t}} $ at any time step $ t $. Any j-th agent possesses a target actor network (represented as parameters $ {\theta }_{j}^{{\mu '}} $) and a target critic network (represented as parameters $ {\theta }_{j}^{{Q'}} $) to make the training process more stable. Let $ \mathcal{D} $ represent the replay buffer to store the transition $ ({\mathbf{o}}^{\mathbf{t}}{{,}}{\mathbf{a}}^{\mathbf{t}}{{,}}{\mathbf{r}}^{\mathbf{t}}{{,}}{\mathbf{o}}^{\mathbf{t}+1}) $. We can randomly select a mini-batch $ \mathcal{K} $ of transitions from $ \mathcal{D} $ for training the networks, effectively breaking the correlation among sequential samples and enhancing stability of the training process.

      Figure 2. 

      The framework of the MADDPG algorithm. The blue arrows represent the interaction among the agents and the environment. The black arrows are used to update the parameters of the networks.

      Then, we give the methods to update the actor networks, critic networks and target networks.

      The actor network of j-th agent is updated by using sampled policy gradient for the policy function $ {J}_{j} $:

      $ \begin{array}{c}{\nabla }_{{\theta }_{j}^{\mu }}{J}_{j}=\dfrac{1}{\left|\mathcal{K}\right|}\sum _{{k'}\in \mathcal{K}} {\nabla }_{{\theta }_{j}^{\mu }}{\mu }_{j}\left({o}_{j}^{k}{{;}}{\theta }_{j}^{\mu }\right){\nabla }_{{a}_{j}^{k}}{Q}_{j}\left({\mathbf{o}}^{\mathbf{k}}{{,}}{\mathbf{a}}^{\mathbf{k}};{\theta }_{j}^{Q}\right){|}_{{a}_{j}^{k}={\mu }_{j}\left({o}_{j}^{k}{{;}}{\theta }_{j}^{\mu }\right)} \end{array} $ (11)

      where $ {k}^{\text{'}}=({\mathbf{o}}^{\mathbf{k}}{{,}}{\mathbf{a}}^{\mathbf{k}}{{,}}{\mathbf{r}}^{\mathbf{k}}{{,}}{\mathbf{o}}^{\mathbf{k}+1}) $ is a transition in $ \mathcal{K} $.

      The critic network of j-th agent is updated by minimizing the Mean Square Error (MSE) loss:

      $ \begin{array}{c}L\left({\theta }_{j}^{Q}\right)=\dfrac{1}{\left|\mathcal{K}\right|}{\sum }_{{k'}\in \mathcal{K}}{\left[{Q}_{j}\left({\mathbf{o}}^{\mathbf{k}}{{,}}{\mathbf{a}}^{\mathbf{k}}{{;}}{\theta }_{j}^{Q}\right)-{y}_{j}^{k}\right]}^{2} \end{array} $ (12)

      where, $ {y}_{j}^{k}={r}_{j}^{k}+\gamma {Q}_{j'}({\mathbf{o}}^{\mathbf{k}+1}{{,}}{\mathbf{a}}^{\mathbf{k}+1}{{;}}{\theta }_{j}^{{Q'}}){|}_{{a}_{j}^{k+1}={\mu '}({o}_{j}^{k+1}{{;}}{\theta }_{j}^{{\mu '}})} $.

      The parameters of the target networks of j-th agent are updated through a soft update, that is:

      $ \begin{array}{c}{\theta }_{j}^{{\mu '}}\leftarrow \epsilon{\theta }_{j}^{\mu }+\left(1-\epsilon\right){\theta }_{j}^{{\mu '}}\end{array} $ (13)
      $ \begin{array}{c}{\theta }_{j}^{{Q'}}\leftarrow \epsilon{\theta }_{j}^{Q}+\left(1-\epsilon\right){\theta }_{j}^{{Q'}}\end{array} $ (14)

      where, $ \epsilon $ is a small constant.

      The details of the DCM algorithm are summarized in Algorithm 2, which consists of two phases: training phase (Lines 1−26) and executing phase (Lines 27−35).

      Table 2.  Deploy chargers based on MADDPG.

      Input: The number of episodes Ne, time steps T, a small constant $ \in $, the number of LSCs m, the set S of loTDs, the number of grids L × L;
      Output: Positions of m LSCs, Tc;
      1: for each $ {c}_{j}\in C $ do
      2: Initialize parameters of actor and target network $ {\theta }_{j}^{\mu } $ and $ {\theta }_{j}^{{\mu }'} $;
      3: Initialize parameters of actor and target network $ {\theta }_{j}^{Q} $ and $ {\theta }_{j}^{{Q}'} $;
      4: end for
      5: Initially place m agents in the center of the intermediate grid;
      6: Obtain $ t{f}^{0} $ by calling Algorithm 1;
      7: Clear out the replay buffer $ D $;
      8: for episode from 1 to Ne do
      9: Reset the positions of agents;
      10: Initialize the exploration noise $ \mathcal{N} $;
      11: for time step t from 1 to T do
      12: for agent j from 1 to m do
      13: Get the observation $ {o}_{j}^{t} $;
      14: Choose action $ {a}_{j}^{t}={\mu }_{j}({o}_{j}^{t};{\theta }_{j}^{\mu })+\mathcal{N} $;
      15: end for
      16: Execute joint actions $ {\mathit{a}}^{\mathit{t}}= \{{a}_{1}^{t},... ,{a}_{m}^{t}\} $ of all agents;
      17:
      Execute Algorithm l to obtain $ t{f}^{t} $;
      18: Obtain reward $ {\mathit{r}}^{\mathit{t}} $ and next observations $ {\mathit{o}}^{\mathit{t}+1} $;
      19: Store $ ({\mathbf{o}}^{\mathbf{t}},{\mathbf{a}}^{\mathbf{t}},{\mathbf{r}}^{\mathbf{t}},{\mathbf{o}}^{\mathbf{t}+1}) $ in replay buffer $ D $;
      20: Select random transitions $ \mathcal{K} $ from $ D $;
      21: for agent j from 1 to m do
      22: Update $ {\theta }_{j}^{\mu } $ and $ {\theta }_{j}^{Q} $ by equation (11) and (12);
      23: Update $ {\theta }_{j}^{{\mu '}} $ and $ {\theta }_{j}^{{Q'}} $ by equation (13) and (14);
      24: end for
      25: end for
      26: end for
      27: Reset the positions of agents;
      28: for time step t from 1 to T do
      29: for agent j from 1 to m do
      30: Get the observation $ {o_j^t} $;
      31: Obtain the action $ {a}_j^t={\mu }_{j}({o}_{j}^{t};{\theta }_{j}^{\mu }) $;
      32: Execute the action $ {a}_{j}^{t} $;
      33: end for
      34: end for
      35: Execute Algorithm l to obtain Tc;
      36: return positions of m agents and Tc;

      In the training phase, our target is to train the actor and critic networks of agents. The training phase contains $ {N}_{e} $ episodes and each episode has $ T $ time steps. First, we initialize the parameters of neural networks $ {\theta }_{j}^{\mu } $, $ {\theta }_{j}^{{\mu }^{\text{'}}} $, $ {\theta }_{j}^{Q} $, and $ {\theta }_{j}^{{Q}^{\text{'}}} $ of each $ {c}_{j}\in C $ and execute Algorithm 1. to acquire $ t{f}^{0} $ and the replay buffer $ \mathcal{D} $ is cleared out. Then, at each episode, we initialize positions of $ m $ LSCs and an exploration noise $ \mathcal{N} $. After that, for each time step $ t $ at any one episode, any j-th agent derives the action $ {a}_{j}^{t}={\mu }_{j}({o}_{j}^{t};{\theta }_{j}^{\mu })+\mathcal{N} $ by inputing the local observation $ {o}_{j}^{t} $ in the actor network, and then executes its action. Subsequently, agents get their rewards $ {\mathbf{r}}^{\mathbf{t}} $ and next observations $ {\mathbf{o}}^{\mathbf{t}+1} $ and store the transition $ ({\mathbf{o}}^{\mathbf{t}}{{,}}{\mathbf{a}}^{\mathbf{t}}{{,}}{\mathbf{r}}^{\mathbf{t}}{{,}}{\mathbf{o}}^{\mathbf{t}+1}) $ in the replay buffer $ \mathcal{D} $. Then, the set $ \mathcal{K} $ of random transitions are selected from $ \mathcal{D} $ to train the actor, critic, and target networks. The critic network of j-th agent is updated by minimizing the MSE loss function $ L\left({\theta }_{j}^{Q}\right) $ in Eqn (12) and the actor network of j-th agent is updated by using the policy gradient for policy function $ {J}_{j} $ in Eqn (11). The updates of the target actor network and target critic network of j-th agent are shown in Eqns (13) and (14), respectively.

      In the executing phase, we leverage the well-trained actor networks to determine the actions of agents. Any j-th agent inputs its local observation $ {o}_{j}^{t} $ in its actor network and obtains an action $ {a}_{j}^{t}={\mu }_{j}({o}_{j}^{t};{\theta }_{j}^{\mu }) $ for execution. After $ T $ steps, we obtain the final positions of LSCs and execute Algorithm 1 to obtain $ {T}_{c} $.

    • In this subsection, we propose an approximation algorithm to solve the MLCC problem, which is called MLCC Algorithm (MLCCA). The steps of the MLCCA algorithm are as follows (Algorithm 3).

      First, we divide the monitoring area A evenly into L × L grids.

      Second, for m from 1 to n, we execute Algorithm 2 to obtain the deployment result of m LSCs and Tc. If Tc == Tw then we obtain the value m, otherwise, let m = m + 1.

      Table 3.  MLCCA.

      Input: The set of IoTDs S;
      Output: m;
      1: Divide A evenly into L × L grids;
      2: for m from 1 to n do
      3: Execute Algorithm 2 to deploy m LSCs and obtain Tc;
      4: if Tc == Tw then
      5: break;
      6: else
      7: Let m = m + 1;
      8: end if
      9: end for
      10: return m;
    • In this section, we demonstrate and evaluate the performance of the proposed MLCCA algorithm and the ascendency of SPTIoT through a series of meticulously designed experiments. We first evaluate the convergence of the MADDPG approach under different learning rates and compare the performance of MADDPG with Deep Deterministic Policy Gradient (DDPG). Then, we compare the MLCCA algorithm with other baseline algorithms to demonstrate its effectiveness. Finally, we compare the performance of the SPTIoT network with the general IoT network to highlight its advantages.

      The code is implemented by using MATLAB 2019b and Python 3.6. Table 1 lists some constant parameters utilized in our experiments.

      Table 1.  Experimental parameters.

      Parameters Value Parameters Value
      A [200 m, 200 m]2 Ne 6,000
      L 20 T 20
      ET 200 J $ \left|\mathcal{D}\right| $ 25,600
      $ l $ 1 s $ \left|\mathcal{K}\right| $ 256
      $ {\delta }_{s} $ 10−5 $ \epsilon $ 0.005
      $ {\eta }_{el} $ 0.3 $ \gamma $ 0.95
      $ \omega {A}_{c}\chi $ 0.004 m2 $ \mathcal{N} $ 0.1
      $ \alpha $ 10−6 m $ {P}_{b} $ 50
      $ D $ 0.1 m Layer type Fully connected
      $ {\mathrm{\Delta }}_{\theta } $ 3.4 × 105 Optimizer Adam
      $ {\delta }_{i} $ [5 W, 20 W]
    • As the example shown in Fig. 3, we set the configurations as n = 45, Rc = 50 m, EM = 10,000 J, Tw = 3,600 s, Pc = 3,000 W, Ps = 200 W. We execute the MLCCA algorithm for the instance. Figure 3ad are the deployment results of LSCs when m = 1, 2, 3, 4, respectively. In these subfigures, the solid lines represent the movement paths of these LSCs during deployment and the dotted circles indicate the coverage areas of these LSCs. Only when m = 4, the equality Tc = 3,600 s = Tw holds. Hence, we conclude that m = 4 is the solution of MLCC problem in this instance.

      Figure 3. 

      The deployment result of LSCs for a given instance obtained by the MLCCA algorithm. The deployment result (a) when m = 1, (b) when m = 2, (c) when m = 3, (d) when m = 4.

    • In this subsection, we first evaluate the convergence performance of the MADDPG algorithm under different learning rates. Then, we compare the performance of MADDPG with DDPG.

      We set n = 40, m =3, EM = 10,000 J, Tw = 3,600 s, Rc = 50 m, Pc = 3,000 W, Ps = 200 W.

      In Fig. 4a, we illustrate the convergence performance of the MADDPG algorithm under different learning rates lr = 0.1, lr = 0.01, and lr = 0.001. As seen in the figure, a too high learning rate is unstable and has obvious fluctuation while a too low learning rate causes convergence to local optimal values. Notably, the best convergence effect is achieved with lr = 0.01 compared to rates of 0.1 and 0.001. Consequently, the learning rate is set as 0.01 in subsequent experiments.

      Figure 4. 

      The convergence performance of the MADDPG approach. (a) The convergence evaluation under different learning rates. (b) Comparison between MADDPG and DDPG.

      In Fig. 4b, we compare the convergence performance of MADDPG with DDPG, where the DDPG algorithm has the same htype parameters, observation space, action space, and reward functions. However, the difference is that the actor and critic networks of any LSC are trained only relying on its own observation and action and are not affected by the observations and actions of other LSCs. The results show that the MADDPG algorithm outperforms the DDPG algorithm, reflecting the superiority of multi-agent collaborative working.

    • In this subsection, we compare the performance of the MLCCA algorithm with four other algorithms under different settings and verify the superiority of our proposed algorithm. The baseline algorithms for comparison are listed as follows:

      (1) K-means. The K-means algorithm consists of the following steps. In the first step, initialize m = 1. In the second step, we use the K-means[29] algorithm to obtain m clustering centers by Euclidean distance clustering, and deploy m LSCs to the m clustering centers. In the third step, execute the LCSS algorithm to obtain Tc. Finally, if Tc == Tw then we obtain the value m, otherwise, let m = m + 1 and return to the second step.

      (2) Greedy. The Greedy algorithm is summarized in the following steps. In the first step, we divide the monitoring area A evenly into L × L grids. In the second step, for j from 1 to n, we deploy the j-th LSC in the optimal grid and call LCSS algorithm to obtain Tc, where the optimal grid is defined as: j-th LSC deployed in this grid can obtain larger Tc than deployed in any other grid. If Tc == Tw then we obtain the value m = j, otherwise, let j = j + 1.

      (3) Simulated Annealing (SA)[18]. We employ the Simulated Annealing[18] method to deploy LSCs, where the center of each grid is a candidate charger deployment location.

      (4) Wireless chArger placement wIth obsTacles (WAIT)[22]. We employ the WAIT[22] method (ignoring obstacles) to deploy LSCs, and the goal is to minimize the number of deployed LSCs and meet the energy demands of all IoTDs.

      In Fig. 5, we evaluate the performance of three algorithms by setting EM = 10,000 J, Tw = 3,600 s, Rc = 60 m, Pc = 3,000 W, Ps = 200 W, and changing n from 10 to 40. It can be observed that the MLCCA algorithm outperforms the other algorithms and with the increasing of n, the number of LSCs increases. The SA and WAIT algorithms perform significantly worse than the other three algorithms. This is primarily because previous studies did not consider the energy transfer between IoTDs, needing to deploy more LSCs to cover all IoTDs.

      Figure 5. 

      Performance comparison by changing the number of IoTDs n.

      In Fig. 6, we evaluate the performance of three algorithms by setting n = 30, EM = =10,000 J, Tw = 3,600 s, Pc = 3,000 W, Ps = 200 W, and changing Rc from 40 to 100 m. The figure illustrates that the MLCCA algorithm outperforms the other algorithms and with the increasing of Rc, the performance difference between the five algorithms is getting smaller. The reason is that as the Rc increases, LSCs can cover and provide energy to more IoTDs, thus narrowing the performance difference between the three algorithms. Additionally, the SA and WAIT algorithms notably lag behind the other three algorithms.

      Figure 6. 

      Performance comparison by changing the charging radius Rc.

      In Fig. 7, we compare the performance of three algorithms by setting n = 30, Tw = 3,600 s, Rc = 60 m, Pc = 3,000 W, Ps = 200 W, and changing EM from 5,000 to 20,000 J. The figure shows that the performance of MLCCA algorithm is superior compared with the other algorithms. With the increasing of EM, the curves corresponding to MLCCA, Greedy, and K-means algorithms gradually decrease, while the curves associated with SA and WAIT algorithms remain almost unchanged. This is because n and Rc remain unchanged, requiring the same number of LSCs to cover all IoTDs.

      Figure 7. 

      Performance comparison by varying the maximum battery capacity of IoTDs EM.

    • In this subsection, we undertake a comparative analysis of the SPTIoT and the general IoT network in terms of the number of LSCs required under various parameter settings. Notably, in the general IoT network, IoTDs are incapable of charging other IoTDs. The charging scheduling mechanism in the general IoT operates as follows: during each time slot $ \tau $, each $ {c}_{j}\in C $ charges the IoTD $ {s}_{i}= \mathrm{a}\mathrm{r}\mathrm{g}min\{{E}_{i}^{r}(\tau )|{d}_{i,j}^{c}\le {R}_{c}\wedge \sum _{{c}_{v}\in C\setminus\{{c}_{j}\}} {a}_{i}^{v}\left(\tau \right)=0\} $. All results are illustrated in Fig. 8.

      Figure 8. 

      Performance comparison between SPTIoT and the general IoT network by varying n, Rc, Pc, Tw. Performance comparison by varying (a) the number of IoTDs n, (b) the charging radius Rc, (c) the source power of LSCs Pc, and (d) the working period Tw.

      In Fig. 8a, we show the performance comparison of SPTIoT and general IoT network as we set EM = 10,000 J, Tw = 3,600 s, Rc = 60 m, Pc = 3,000 W, Ps = 200 W, and change n from 10 to 40. The figure shows that the SPTIoT outperforms the general IoT network, and as n increases, the number of LSCs in SPTIoT increases less than that in the general IoT network.

      In Fig. 8b, we set n = 30, EM = 10,000 J, Tw = 3,600 s, Pc = 3,000 W, Ps = 200 W, and change Rc from 40 to 100 m. The figure shows that as Rc increases, the SPTIoT has better performance, and the difference between the number of LSCs in the two networks reduces. The reason is that with the increase of Rc, LSCs can cover and provide energy to more IoTDs, thus narrowing the performance difference between the two networks.

      In Fig. 8c, we set n = 30, EM = 10,000 J, Tw = 3,600 s, Rc = 60 m, Ps = 200 W, and change from Pc from 2,000 to 8,000 W. The figure expresses that the SPTIoT maintains superior performance, and as Pc increases, the number of LSCs in SPTIoT has significantly decreased, while it remains almost unchanged in the general IoT network. This is because in the general IoT, each IoTD needs to be covered by LSCs, and increasing Pc will not change the coverage situation.

      In Fig. 8d, we set n = 30, EM = 10,000 J, Rc = 60 m, Pc = 3,000 W, Ps = 200 W, and change from Tw from 1,200 to 8,400 s. The figure illustrates that the SPTIoT has better performance. This superiority is attributed to the efficient charging scheduling mechanisms employed by SPTIoT, which enable it to effectively meet the energy demands of IoTDs over extended periods.

    • In this paper, we consider the Self-organizing Power Transfer Internet of Things (SPTIoT) network framework and investigate the Minimizing Laser Chargers Coverage (MLCC) problem, which aims at minimizing the number of LSCs deployed in SPTIoT and ensuring all IoTDs in SPTIoT work continuously. Then, we prove the MLCC problem is NP-hard. To solve the problem, we first propose a sub-problem, Charging Scheduling Problem (CSP), and corresponding Layered Charging Scheduling Strategy (LCSS) algorithm to solve it. Then we designed the Deploy Chargers based on the MADDPG (DCM) algorithm to deploy the given LSCs in the network. After that, we propose an approximation algorithm called the MLCC Algorithm (MLCCA) to solve the MLCC problem based on the LCSS algorithm and the DCM algorithm. Finally, we verify the effectiveness of the proposed algorithm as well as the superiority of SPTIoT compared to the general IoT with a large number of simulations.

      • The authors confirm contribution to the paper as follows: study conception and design: Hu J, Luo C; data collection: Hu J; analysis and interpretation of results: Hu J, Luo C, Hong Y, Li D, Fan X, Wang G; draft manuscript preparation: Hu J, Luo C. All authors reviewed the results and approved the final version of the manuscript.

      • The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

      • This work was supported in part by the National Natural Science Foundation of China (Grant Nos 62202054, 62002022) and the Young Elite Scientists Sponsorship Program by CAST (2023QNRC001).

      • The authors declare that they have no conflict of interest.

      • Copyright: © 2025 by the author(s). Published by Maximum Academic Press, Fayetteville, GA. This article is an open access article distributed under Creative Commons Attribution License (CC BY 4.0), visit https://creativecommons.org/licenses/by/4.0/.
    Figure (8)  Table (4) References (29)
  • About this article
    Cite this article
    Hu J, Luo C, Hong Y, Wang G, Fan X, et al. 2025. Deployment optimization of laser chargers in self-organizing power transfer internet of things. Wireless Power Transfer 12: e010 doi: 10.48130/wpt-0025-0004
    Hu J, Luo C, Hong Y, Wang G, Fan X, et al. 2025. Deployment optimization of laser chargers in self-organizing power transfer internet of things. Wireless Power Transfer 12: e010 doi: 10.48130/wpt-0025-0004

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return