1
1
mirror of https://github.com/go-gitea/gitea synced 2024-12-22 16:44:26 +00:00
gitea/models/repo
Bruno Sofiato 900ac62251
Allow code search by filename (#32210)
This is a large and complex PR, so let me explain in detail its changes.

First, I had to create new index mappings for Bleve and ElasticSerach as
the current ones do not support search by filename. This requires Gitea
to recreate the code search indexes (I do not know if this is a breaking
change, but I feel it deserves a heads-up).

I've used [this
approach](https://www.elastic.co/guide/en/elasticsearch/reference/7.17/analysis-pathhierarchy-tokenizer.html)
to model the filename index. It allows us to efficiently search for both
the full path and the name of a file. Bleve, however, does not support
this out-of-box, so I had to code a brand new [token
filter](https://blevesearch.com/docs/Token-Filters/) to generate the
search terms.

I also did an overhaul in the `indexer_test.go` file. It now asserts the
order of the expected results (this is important since matches based on
the name of a file are more relevant than those based on its content).
I've added new test scenarios that deal with searching by filename. They
use a new repo included in the Gitea fixture.

The screenshot below depicts how Gitea shows the search results. It
shows results based on content in the same way as the current version
does. In matches based on the filename, the first seven lines of the
file contents are shown (BTW, this is how GitHub does it).


![image](https://github.com/user-attachments/assets/9d938d86-1a8d-4f89-8644-1921a473e858)

Resolves #32096

---------

Signed-off-by: Bruno Sofiato <bruno.sofiato@gmail.com>
2024-10-11 23:35:04 +00:00
..
archiver.go
attachment_test.go
attachment.go Azure blob storage support (#30995) 2024-05-30 07:33:50 +00:00
avatar_test.go Fix duplicate sub-path for avatars (#31365) 2024-06-15 11:43:57 +08:00
avatar.go Return an empty string when a repo has no avatar in the repo API (#31187) 2024-06-01 11:49:42 +00:00
collaboration_test.go
collaboration.go
fork_test.go
fork.go
git.go
issue.go Clean up log messages (#30313) 2024-04-07 19:17:06 +08:00
language_stats.go
license.go Support repo license (#24872) 2024-10-01 15:25:08 -04:00
main_test.go
mirror.go
pushmirror_test.go
pushmirror.go
redirect_test.go
redirect.go
release_test.go Add tag name in the commits list (#31082) 2024-08-19 17:04:06 +00:00
release.go Included tag search capabilities (#32045) 2024-09-18 02:33:11 +08:00
repo_indexer.go
repo_list_test.go Allow code search by filename (#32210) 2024-10-11 23:35:04 +00:00
repo_list.go bump to go 1.23 (#31855) 2024-09-10 02:23:07 +00:00
repo_test.go
repo_unit_test.go
repo_unit.go Refactor repo unit "disabled" check (#31389) 2024-06-18 00:51:13 +00:00
repo.go Make GetRepositoryByName more safer (#31712) 2024-07-29 01:32:54 +00:00
search.go Rename repo_model.SearchOrderByMap to repo_model.OrderByMap (#31359) 2024-06-15 06:45:02 +00:00
star_test.go
star.go
topic_test.go
topic.go
update.go
upload.go
user_repo_test.go Get repo assignees and reviewers should ignore deactivated users (#30770) 2024-04-30 08:43:08 +00:00
user_repo.go Get repo assignees and reviewers should ignore deactivated users (#30770) 2024-04-30 08:43:08 +00:00
watch_test.go
watch.go
wiki_test.go
wiki.go