Skip to content

dlwsdqdws/Trojan-Map

Repository files navigation

TrojanMap

Contact Information:

Name: Lei Lu

Email: [email protected]

Video Introduction

intr

TrojanMap

This project focuses on using data structures in C++ and implementing various graph algorithms to build a map application.

Trojan


The data Structure

Each point on the map is represented by the class Node shown below and defined in trojanmap.h.

class Node {
  public:
    std::string id;    // A unique id assign to each point
    double lat;        // Latitude
    double lon;        // Longitude
    std::string name;  // Name of the location. E.g. "Bank of America".
    std::vector<std::string>
        neighbors;  // List of the ids of all neighbor points.
};

Prerequisites

OpenCV Installation

For visualization, we use OpenCV library. You will use this library as a black box and don't need to worry about the graphic details.

Use the following commands to install OpenCV.

$ cd 2021Fall_TrojanMap
$ git clone https://github.com/opencv/opencv.git

Other library Installations

For Ubuntu:

$ sudo apt-get install cmake libgtk2.0-dev pkg-config
$ sudo apt install libcanberra-gtk-module libcanberra-gtk3-module
$ sudo apt-get install libncurses5-dev libncursesw5-dev
$ cp ubuntu/* ./

For MacOS:

$ brew install cmake
$ brew install ncurses

Next, type the following, but make sure that you set the path_to_install_folder to be the absolute path to the install folder under opencv.

$ cd opencv/
$ mkdir build install
$ cd build
$ cmake -D CMAKE_INSTALL_PREFIX=**path_to_install_folder**\
 -D BUILD_LIST=core,highgui,imgcodecs,imgproc,videoio\
 -D WITH_TBB=ON -D WITH_OPENMP=ON -D WITH_IPP=ON\
 -D CMAKE_BUILD_TYPE=RELEASE -D BUILD_EXAMPLES=OFF\
 -D WITH_NVCUVID=ON\
 -D WITH_CUDA=ON\
 -D BUILD_DOCS=OFF\
 -D BUILD_PERF_TESTS=OFF\
 -D BUILD_TESTS=OFF\
 -D WITH_CSTRIPES=ON\
 -D WITH_OPENCL=ON ..
$ make install

For example, if cloned this repo under "/Users/ari/github/TrojanMap", you should type:

$ cd opencv/
$ mkdir build install
$ cd build
$ cmake -D CMAKE_INSTALL_PREFIX=/Users/ari/github/TrojanMap/opencv/install\
 -D BUILD_LIST=core,highgui,imgcodecs,imgproc,videoio\
 -D WITH_TBB=ON -D WITH_OPENMP=ON -D WITH_IPP=ON\
 -D CMAKE_BUILD_TYPE=RELEASE -D BUILD_EXAMPLES=OFF\
 -D WITH_NVCUVID=ON\
 -D WITH_CUDA=ON\
 -D BUILD_DOCS=OFF\
 -D BUILD_PERF_TESTS=OFF\
 -D BUILD_TESTS=OFF\
 -D WITH_CSTRIPES=ON\
 -D WITH_OPENCL=ON ..
$ make install

Run the program

Please run:

$ bazel run src/main:main

If everything is correct, this menu will show up.

Torjan Map
**************************************************************
* Select the function you want to execute.                    
* 1. Autocomplete                                             
* 2. Find the position                                        
* 3. CalculateShortestPath                                    
* 4. Travelling salesman problem                              
* 5. Cycle Detection                                          
* 6. Topological Sort                                         
* 7. Find K Closest Points                                    
* 8. Exit                                                     
**************************************************************
Please select 1 - 8:

Function for each menu item

Step 1: Autocomplete the location name

std::vector<std::string> Autocomplete(std::string name);

We consider the names of nodes as the locations. Implement a method to type the partial name of the location and return a list of possible locations with partial name as prefix. Please treat uppercase and lower case as the same character.

Example:

Input: "Saint"
Output:

Trojan

Step 2: Find the place's Coordinates in the Map

std::pair<double, double> GetPosition(std::string name);

Given a location name, return the latitude and longitude. There are no duplicated location names. You should mark the given locations on the map. If the location does not exist, return (-1, -1).

Example:

Input: "Our Savior Chapel"
Output:

Our Savior Chapel

Our Savior Chapel

Step 3: CalculateShortestPath between two places

std::vector<std::string> CalculateShortestPath_Dijkstra(std::string &location1_name,
                                               std::string &location2_name);
std::vector<std::string> CalculateShortestPath_Bellman_Ford(std::string &location1_name,
                                               std::string &location2_name);

Given 2 locations A and B, find the best route from A to B. The distance between 2 points is the euclidean distance using latitude and longitude.

Example:

Input: "Target", "ChickfilA"
Output: [ "5237417650","6813379479","5237381975","6813379481","7204030702","4015372484","6813379430", "6813379429","544671955","21306061","6813379428","6813416122","544671962","21306060", "6813379469","6813379427","123005255","6807200376","6807200380","6813379451","6813379463", "123327639","6813379460","4141790922","4015423963","1286136447","1286136422","4015423962", "6813379494","63068643","6813379496","123241977","4015372479","4015372477","1732243576", "6813379548","4015372476","4015372474","4015372468","4015372463","6819179749", "1732243544", "6813405275","348121996","348121864","6813405280","1472141024","6813411590", "216155217","6813411589","1837212103","1837212101","6820935911","4547476733" ]

Routing

Comparing the result with the route in Apple Maps, we can find that The result in Trojan Map is very similar to the route in Apple Maps.

Routing

Both Dijkstra Algorithm and Bellman-Ford Algorithm can get the same answer, but rum time is different.

Complexity of Dijkstra with heap-optimization is O(nlogm), where n is the number of points and m is the number of edges in map.

Complexity of Bellman-Ford with queue-optimization is O(nm), where n is the number of points and m is the number of edges in map.

In this example, runtime of Dijkstra is 23521 msec, runtime of Bellman-Ford is 76848 msec.

dijkstra

bellmanford

Step 4: The Traveling Trojan Problem (AKA Traveling Salesman!)

Given a complete graph(a vector of location ids), which means each node is a neighbor of all other nodes(every location can reach all other locations in the vector).Find the shortest route that covers all the locations exactly once and goes back to the start point.

There are severao algorithms can be used. Now selecting N random points on the map a to find the path to travel these points and back to the start point and compare those algorithms.

  • Backtracking

Backtracking can always get the optimal solution since it will traverse all possible routes.

std::pair<double, std::vector<std::vector<std::string>>> TravellingTrojan(
      std::vector<std::string> &location_ids);

TSP videos

TSP

TSP

However, to get such an accurate result cost much time. The run time in this case is 507691 msec. Complexity of Backtracking is O(n!), where n is the number of points. Of course we can also get the Optimal solution by using Backtracking since it will traver all possible solutions.

2-opt is one of the simplest Heuristic Algorithm. The main idea behind it is to take a route that crosses over itself and reorder it so that it does not. The image below illustrates this issue.

TSP

This is the same as changing E,A,C,B,D,F,E into E,A,B,C,D,F,E. Visually one can intuitively associate this to a rectangle and note that A,C and B,D are like diagonals which would then be greater than the laterals A,B and C,D. This is obviously a generalization though.

std::pair<double, std::vector<std::vector<std::string>>> TravellingTrojan_2opt(
      std::vector<std::string> &location_ids);

TSP videos

TSP

TSP

It can be seen that 2-opt can not always get the optimal solution but the run time is much shorter than Backtracking. Complexity of 2-opt is O(n^2), where n is the number of locations.

Simulated Annealing(SA) is a probabilistic technique for approximating the global optimum of a given function. In TSP, the given function is the total length of traveling all locations and we want to get the minimun result. Unlike 2opt, it will not always move in the direction of local optima. If the new solution is better, then iterate to the new solution. On the contrary, iterate to a new solution with a little probability. This probability is related to the degree of excellence of the solution, and also related to the "temperature" of the SA. 2-opt with SA can make 2-opt not easy to fall into the local optimal situation.

TSP videos

TSP

TSP

There is a mature method for parameter selection of SA. Normally, if we want to get a better result, we must spend more time. When solving TSP, we can set the simulated annealing time to a time that the customer can tolerate.

In 2-opt move we remove 2 links from cyclic tour, this way obtaining 2 open segments, which we can manipulate and combine them to get a new tour. In 3-opt we remove 3 links, obtaining 3 segments to manipulate. This gives us eight combinations (including the tour identical with the initial one).

3-opt move cases are as follows:

TSP

std::pair<double, std::vector<std::vector<std::string>>> TravellingTrojan_3opt(
      std::vector<std::string> &location_ids);

TSP videos

TSP

TSP

Complexity of 3-opt is O(n^3). In general, 3-opt will be more accurate than 2-opt and cost more time than 2-opt.

Genetic algorithms(GA) are heuristic search algorithms inspired by the process that supports the evolution of life. The algorithm is designed to replicate the natural selection process to carry generation, i.e. survival of the fittest of beings. Standard genetic algorithms are divided into five phases which are:

  1. Creating initial population.
  2. Calculating fitness.
  3. Selecting the best genes.
  4. Crossing over.
  5. Mutating to introduce variations.
std::pair<double, std::vector<std::vector<std::string>>> TravellingTrojan_GA(
      std::vector<std::string> &location_ids);

TSP videos

TSP

TSP

The operating efficiency of GA largely depends on the selection of parameters. There are many papers on the selection strategy of genetic algorithm parameters, so I won’t repeat them here.

There are also many other ways like Dynamic Programming(DP), Ant Colony Algorithm(ACA) to solve TSP. There is no absolute best way. When choosing an algorithm, you must consider the specific circumstances, such as the size of the data, the time the customer can wait, etc., to combine the above algorithms. Combination and optimization of algorithms are to be completed.

Compareing of Algorithms

Now I choose some locations and run different algorithms on the same data to compare the accuracy and runtime of algorithms. The locations are {"123120189", "4011837229", "4011837224", "2514542032", "2514541020", "1931345270", "4015477529", "214470792", "63068532", "6807909279"} and the best result is gt{"123120189", "1931345270", "4011837224", "4011837229", "2514542032", "2514541020", "6807909279", "63068532", "214470792", "4015477529", "123120189"}. Length of best result is 4.6174216 mles.

The result of Backtracking is:

TSP

The result of 2-opt is:

TSP

The result of 2-opt with SA is:

TSP

The result of 3-opt is:

TSP

The result of GA is:

TSP

The comparison of path length and running time is shown in the following table:

TSP

It can be seen that, in this case, path length of 3-opt is closest to the best solution. But combining accuracy and running time, genetic algorithm is the best algorithm.

Step 5: Cycle Detection

bool CycleDetection(std::vector<double> &square);

In this section, we use a square-shaped subgraph of the original graph by using four coordinates stored in std::vector<double> square, which follows the order of left, right, upper, and lower bounds.

Then try to determine if there is a cycle path in the that subgraph. If it does, return true and report that path on the map. Otherwise return false.

Example 1:

Input: square = {-118.299, -118.264, 34.032, 34.011}
Output: true

TSP

Here we use the whole original graph as our subgraph.

TSP

Example 2:

Input: square = {-118.290919, -118.282911, 34.02235, 34.019675}
Output: false

TSP

Here we use a square area inside USC campus as our subgraph

TSP

Step 6: Topological Sort

std::vector<std::string> DeliveringTrojan(std::vector<std::string> &location_names,
                                            std::vector<std::vector<std::string>> &dependencies);

Say, Tommy Trojan got a part-time job from TrojanEats, for which he needs to pick up and deliver food from local restaurants to various location near the campus. Tommy needs to visit a few different location near the campus with certain order, since there are some constraints. For example, he must first get the food from the restaurant before arriving at the delivery point.

The Trojan Map will have some instructions about these constraints. The solution is running toposort on two files, one containing location names needs to visit, the other containing some dependencies between those locations. The compexity of O(n+m), where n is the number of location names and m is the number of dependencies. If no feasible route exists, a empty vector will be returned.

For example,

Input: 
location_names = {"Cardinal Gardens", "Coffee Bean1", "CVS"}
dependencies = {{"Cardinal Gardens","Coffee Bean1"}, {"Cardinal Gardens","CVS"}, {"Coffee Bean1","CVS"}}

Here, {"Cardinal Gardens","Coffee Bean1"} means that Tommy must go to Cardinal Gardens prior to Coffee Bean1.

Output: Cardinal Gardens -> Coffee Bean1 -> CVS

TSP

TSP

Step 7: Find K closest points

Given a location name and a integer k , find the k closest locations with name on the map and return a vector of string ids.

We will use Heap to realize it, and the complexity if O(nlogn), where n is the number of points in Trojan Map.

std::vector<std::string> FindKClosestPoints(std::string name, int k);

For example, we want to find the 8 closest points from ChickfilA.

Kclosest

Kclosestmap

Acknowledgment and References

I gratefully acknowledge the help of professor Arash Saifhashemi and alumni Haoxin Ma, Tianhong Qi and Zixin Zhang.

All references are attached to the citation in the form of hyperlinks.