perspective_en - LmeSzinc/AzurLaneAutoScript GitHub Wiki

Map Detection

Video tutorials (In Chinese):

[碧蓝航线]海图识别-从屏幕内容到海域信息(上)

[碧蓝航线]海图识别-从屏幕内容到海域信息(下)

Map detection is the core of an Azur lane bot. If simply using template matching to do the enemy detection, it will inevitably appear BOSS block by enemies. AzurLaneAutoScript (Alas), provides a better approach of map detection. In module.map, you can get full information in map, such as:

2020-03-10 22:09:03.830 | INFO |    A  B  C  D  E  F  G  H
2020-03-10 22:09:03.830 | INFO | 1 -- ++ 2E -- -- -- -- --
2020-03-10 22:09:03.830 | INFO | 2 -- ++ ++ MY -- -- 2E --
2020-03-10 22:09:03.830 | INFO | 3 == -- FL -- -- -- 2E MY
2020-03-10 22:09:03.830 | INFO | 4 -- == -- -- -- -- ++ ++
2020-03-10 22:09:03.830 | INFO | 5 -- -- -- 2E -- 2E ++ ++

module.map mainly consists of the following files:

  • perspective.py Perspective detection
  • grids.py Grid data parsing
  • camera.py Camera moving
  • fleet.py Fleet moving
  • map.py Map logics for enemy searching

One Point Perspective

Before understanding how alas do map detection, we have to go through some basic knowledge of one point perspective. Map of Azur Lane is grid in one point perspective. Parsing perspective needs to calculate vanish point and distant point.

In one point perspective:

  • the perspective of horizontal lines are still horizontal lines.
  • the perspective of all vertical lines intersect at one point, called vanish point. The further a vanish point away from grids, the perspective of vertical lines closer to 90 degree.

vanish_point

  • All diagonals of the grids intersect at one point, called distant point, The further a distant point away from grids, the grid become fatter. In fact, there are 2 distant point, the following image draws the one to the left of vanish point.

distant_point

Screenshot Pre-processing

preprocess

When perspective.py gets an screenshot, function load_image do such process:

  • crop area of detection
  • to grayscale, Using the algorithm in Photoshop, (MAX(R, G, B) + MIN(R, G, B)) // 2
  • cover UI. Here use overlay.png
  • Reverse color

(Image above is before reverse, because the reversed image is too terrified to show)

Grid Detection

Detecting Grid Lines

Grid lines are black lines with a transparency of 20%. In 720P, it has 3 to 4 pixel wide. During the period of "old-UI", we simply move the image 1px and divide by the origin image to detect grid lines. White frame with transparency gradient is added in "new-UI", which increase the difficulty of detection.

Function find_peaks use scipy.signal.find_peaks to find grid lines. scipy.signal.find_peaks can find peaks of given data.

Crop image at height == 370, use following parameters:

FIND_PEAKS_PARAMETERS = {
    'height': (150, 255 - 40),
    'width': 2,
    'prominence': 10,
    'distance': 35,
}

find_peaks

As you can see, some grid lines are not detected and has many mistake as well. Not a big deal.

Scan every row and draw the image. (For better performance, image will be flatten to 1-D array before detection, which will reduce time cost to 1/4.)

peaks

We gets 4 images so far, they are vertical inner lines, horizontal inner lines, vertical edge lines, horizontal edge lines. This process takes about 0.13 s on I7-8700k , and the full map detection process will take about 0.15 s.

P. S. Parameters use to detect inner lines are different from edge lines. In different maps, we should use different parameters. If you are lazy, you can use the default parameters, which is for 7-2. Those parameters can be used in Chapter 7, can even be used in 北境序曲(event_20200227_cn) D3.

Fitting Grid Lines

Function hough_lines use cv2.HoughLines to detect lines. Now we have 4 group of lines.

hough_lines_1

Take vertical inner lines for example. There some incorrect lines.

We create a horizontal line at the middle of image, called MID_Y, (When fixing vertical lines, create a vertical one), and cross vertical inner lines, those crossing points are called mid. If the distance between two mids smaller than 3, we treat them as a group of lines, and replace them with their average. After that, we corrected the result.

Fitting Vanish Point

As mention above, all vertical lines in one point perspective intersect at one point. There are errors in vertical lines, so we can't solve the equations to get that.

Function _vanish_point_value , use to calculate the distance between a point and a group of lines, and use scipy.optimize.brute to brute-force solve the closest point to vertical lines, which is called vanish point. This surface shows the sum of distance from the point to the group of vertical line. In order to ignore wrong lines far away from vanish point, it uses logarithm.

vanish_point_distance

Still remember mid ? we re-link then to vanish point, and act as vertical lines. This is the 2nd correction.

Fitting Distant Point

We intersect the corrected vertical lines and the origin horizontal lines. distant point and canish point are on the same horizontal line, so we take a point on this horizontal line, and link all intersection, get oblique lines. Function _distant_point_value calculates the distance between the mid of oblique lines. Also use scipy.optimize.brute to brute-force solve the closet point, called distant point.

This image draws the oblique lines. Although there are many mistakes, it do gets the correct point.

diatant_point_links

Cleansing Grid Lines

With the above process, we get grid lines like this. It's generally correct, but with mistakes.

mid_cleanse_before

Take the mid of vertical lines.

[ 185.63733413  315.65944444  441.62998244  446.89313842  573.6301653
  686.40881027  701.20376316  830.27394123  959.00511191 1087.91874026
 1220.58809477]

We know all grid has a same width, so theoretically, mid is an arithmetic progression, but with wrong members and missing members. Use a linear function y = a * x + b to describe that. Because of mistakes and missing, the x in linear function may not be the number n in arithmetic progression. As long as mistakes less than 10, there will have x ∈ [n - 10, n + 10] .

Then, transform the linear function as b = -x * a + y, and x ∈ [n - 10, n + 10] . If treat a to be independent variable and treat b to be dependent variable, it's a group of lines with amount of 11 * 21. Draw them.

mid_cleanse_lines_with_circle

Discover that many lines intersect at where the orange circle pointed out, we call them coincident point. Those incorrect mid from incorrect lines can't intersect there, and get deleted.

Use scipy.optimize.brute to brute-force solve the coordinate of the best coincident point.

[-201.33197146  129.0958336]

So the linear function is y = 129.0958336 * x - 201.33197146 .

When calculating distance to the lines, it uses this function

distance = 1 / (1 + np.exp(9 / distance) / distance)

This function makes it less effect by lines far away, encourage optimizer to choose the local minimum.

mid_cleanse_function

How to cleanse horizontal lines?

Make any line through distant point, link intersections and vanish point . This finish a map relation from horizontal lines to vertical lines. When cleanse finished, do a reversed process.

mid_cleanse_convert

At last, generate mid, and crop it with the edge of map and screen. Missing mid get filled now. Re-link mid to vanish point, and the cleansing of grid lines is finished.

Draw results:

mid_cleanse_after

Grid Cropping

In fact, shipgrils, enemies, mystery are images fixed on grid center. They are scaled because of perspective.

P. S. They are scaled only, but not perspective transform. Only red border and yellow border on the ground are perspective transformed.

crop_basic

In grid_predictor.py, provides function get_relative_image , which do crops according to grid center, and rescale to given shape. Now we can simply use template matching.

from PIL import Image
from module.config.config import cfg
i = Image.open(file)
grids = Grids(i, cfg)
out = Image.new('RGB', tuple((grids.shape + 1) * 105 - 5))
for loca, grid in grids.grids.items():
    image = grid.get_relative_image(
    	(-0.415 - 0.7, -0.62 - 0.7, -0.415, -0.62), output_shape=(100, 100))
    out.paste(image, tuple(np.array(loca) * 105))
out

crop_scale

Parsing Grid Data

To be continued.