You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently we're querying in the link_content, which contains the whole html of the page and includes some stuff that we're not interested at, like comments, ads etc. This affects the precision of the search.
We should be searching the query only on the cleaned text of the article. One way of doing this is using the Goose library to clean the link_content and then start making the search on that cleaned text.
The text was updated successfully, but these errors were encountered:
Currently we're querying in the link_content, which contains the whole html of the page and includes some stuff that we're not interested at, like comments, ads etc. This affects the precision of the search.
We should be searching the query only on the cleaned text of the article. One way of doing this is using the Goose library to clean the link_content and then start making the search on that cleaned text.
The text was updated successfully, but these errors were encountered: