Differentiate Between Major and Minor Article Changes #149

Open
opened 2019-02-10 11:16:58 -05:00 by dustinwilson · 1 comment
Owner

Many applications simply show updated articles as new and unread ones which makes sense if the article actually changes a lot and is updated with a significant amount of information. In most cases articles are updated with fixed typos, and showing them as new articles isn't helpful at all.

What we maybe could do is use a levenshtein comparison between two articles and silently update articles that don't change much. We would just need to find a happy threshold.

Some things that might be difficult would be something like images and video where if the article is mostly images and/or video. Changing an image without changing its location doesn't affect this at all, and even adding a couple of images to it doesn't change the article's markup much either, so maybe weigh changes on images and video markup more?

Many applications simply show updated articles as new and unread ones which makes sense if the article actually changes a lot and is updated with a significant amount of information. In most cases articles are updated with fixed typos, and showing them as new articles isn't helpful at all. What we maybe could do is use a levenshtein comparison between two articles and silently update articles that don't change much. We would just need to find a happy threshold. Some things that might be difficult would be something like images and video where if the article is mostly images and/or video. Changing an image without changing its location doesn't affect this at all, and even adding a couple of images to it doesn't change the article's markup much either, so maybe weigh changes on images and video markup more?
dustinwilson added the
enhancement
feature
labels 2019-02-10 11:16:58 -05:00
jking added this to the Future milestone 2019-03-04 08:16:04 -05:00
Owner

Either relying on or importing a stripped-down version of https://github.com/jfcherng/php-levenshtein-distance might be the best way to calculate edit distance.

Either relying on or importing a stripped-down version of https://github.com/jfcherng/php-levenshtein-distance might be the best way to calculate edit distance.
Sign in to join this conversation.
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: MensBeam/Arsse#149
No description provided.