](https://deep-paper.org/en/paper/2505.21943/images/cover.png)
Escaping the Trap of Point-to-Point Loss: How Point-to-Region Matching Solves Semi-Supervised Crowd Counting
Imagine looking at a photograph of a packed stadium or a bustling city square. Your task is to count every single person. In computer vision, this is the task of Crowd Counting, and it is critical for urban planning, safety monitoring, and traffic control. Deep learning has made massive strides in this field. However, there is a bottleneck: data annotation. To train a model to count people, humans currently have to manually place a dot on the head of every single person in thousands of training images. In a dense crowd, a single image might contain thousands of people. The labor cost is astronomical. ...
](https://deep-paper.org/en/paper/2412.03451/images/cover.png)
](https://deep-paper.org/en/paper/2502.07785/images/cover.png)
](https://deep-paper.org/en/paper/2403.11116/images/cover.png)
](https://deep-paper.org/en/paper/2503.20308/images/cover.png)
](https://deep-paper.org/en/paper/2412.18608/images/cover.png)
](https://deep-paper.org/en/paper/2412.15119/images/cover.png)
](https://deep-paper.org/en/paper/2503.18420/images/cover.png)
](https://deep-paper.org/en/paper/2503.20779/images/cover.png)
](https://deep-paper.org/en/paper/file-2168/images/cover.png)
](https://deep-paper.org/en/paper/2403.11295/images/cover.png)
](https://deep-paper.org/en/paper/file-2165/images/cover.png)
](https://deep-paper.org/en/paper/2412.00115/images/cover.png)
](https://deep-paper.org/en/paper/2503.19199/images/cover.png)
](https://deep-paper.org/en/paper/2407.09392/images/cover.png)
](https://deep-paper.org/en/paper/file-2161/images/cover.png)
](https://deep-paper.org/en/paper/2412.16604/images/cover.png)
](https://deep-paper.org/en/paper/2501.03841/images/cover.png)
](https://deep-paper.org/en/paper/2412.09612/images/cover.png)
](https://deep-paper.org/en/paper/2503.00361/images/cover.png)