Merged with the original to update the gem, but keep our customizations #1

grumpit · 2018-07-21T15:35:40Z

No description provided.

[ci skip]

These vars were already created at line 13 but they weren't used before.

Use a_start and b_start variables in HashDiff.lcs

A hash key with anything not matching \w (word characters) like -|, etc would not match the regex detecting arrays in the patch path and would not patch correctly.

Fix bug with array under hash key with non-word characters.

fix an error when a hash has mixed types

- Add 2.2, 2.3 and 2.4 to Travis CI. - Modern version of `rake` fails with old RSpec so its version should be restricted. - Fixnum is deprecated in Ruby 2.4.0 so replace it with Integer.

New rubies support.

Mention 2 compelling reasons to start using HashDiff

HashDiff#similar? compares values based on "node count" using HashDiff#count_nodes and HashDiff#diff (if the counts don't cancel out). If `a` and `b` are arrays/hashes `count_nodes` recursively counts the elements, otherwise returns 1. If `a` and `b` are not arrays/hashes, HashDiff#similar? needlessly counts/diffs values that will ultimately end up passing through HashDiff#compare_values. A considerable performance improvement can be had by circumventing the needless recursion and call HashDiff#compare_values if `a` and `b` are not arrays/hashes. This has been benchmarked with this snippet: ```ruby $LOAD_PATH << File.join(File.dirname(__FILE__), 'lib') require 'hashdiff' require 'benchmark' seq1 = %w(a b c e h j l m n p) seq2 = %w(a b c d e f j k l m r s t) n = 10000 Benchmark.bm do |x| x.report('lcs') { n.times do ; HashDiff.lcs(seq1, seq2); end } end ``` Before: ```sh $ ruby benchmark.rb user system total real lcs 6.640000 0.010000 6.650000 ( 6.649822) ``` After: ```sh $ ruby benchmark.rb user system total real lcs 1.030000 0.010000 1.040000 ( 1.037542) ```

Greatly improve performance of HashDiff#similar?

add codecov to show coverage

The patches suggested in the README only work when you have string keys on the hash.

This introduces an array_path option that can be used when generating a diff. This represents the path to aspects of the diff as an array rather than a string. eg. ``` x = {'a' => 1} y = {'a' => 2} HashDiff.diff(x, y) => [["~", "a", 1, 2]]h HashDiff.diff(x, y, :array_path => true) => [["~", ["a"], 1, 2]] ``` This allows there to be more flexibility with the types used as keys in a hash. Allowing workarounds for issues such as: liufengyun#25 eg ``` x = {'a'=>1} y = {:a=>1} HashDiff.diff(x, y) => [["-", "a", 1], ["+", "a", 1]] HashDiff.diff(x, y, :array_path => true) => [["-", ["a"], 1], ["+", [:a], 1]] ``` And improved ability to patch hashes with keys: eg ``` x = {a: {b: :c}} y = {a: {b: :d}} diff = HashDiff.diff(x, y) => [["~", "a.b", :c, :d]] HashDiff.patch!(x, diff) NoMethodError: undefined method `[]=' for nil:NilClass diff = HashDiff.diff(x, y, array_path: true) => [["~", [:a, :b], :c, :d]] HashDiff.patch!(x, diff) => {:a=>{:b=>:d}} ``` This updates the `patch!` and `unpatch!` methods to accept diffs with either paths as strings or as arrays.

s/comparisions/comparisons

Introduce an array_path option

The LCS algorithm produces excellent diffs. Unfortunately it has a complexity, running at n^2 for the number of items in an array - which can lead to extremely slow computations. ``` > require 'hashdiff';require 'benchmark' > x = (1..100).map { |i| { key: i, foo: :bar } } > puts Benchmark.measure { HashDiff.diff(x, x) } 0.420000 0.000000 0.420000 ( 0.430212) ``` If the size of the array is 1000 then we see it get really painful ``` > x = (1..1000).map { |i| { key: i, foo: :bar } } > puts Benchmark.measure { HashDiff.diff(x, x) } 42.680000 0.590000 43.270000 ( 43.530287) ``` This commit introduces an option to sacrifice the quality of the diff for a faster computational result with the `use_lcs` option, which can be set to false to disable use of the LCS algorithm. With `use_lcs` as false the array comparison is much simpler with a complexity of at worst 2n for an array. ``` > x = (1..100).map { |i| { key: i, foo: :bar } } > puts Benchmark.measure { HashDiff.diff(x, x, use_lcs: false) } 0.010000 0.000000 0.010000 ( 0.004894) > x = (1..1000).map { |i| { key: i, foo: :bar } } > puts Benchmark.measure { HashDiff.diff(x, x, use_lcs: false) } 0.040000 0.000000 0.040000 ( 0.042547) ``` The linear approach to comparing the array works on the basis that if arrays are the same length it treats the array as having no additions or deletions, only changes. ``` > HashDiff.diff([0,1,2], [3,4,5], use_lcs: false) => [["~", "[0]", 0, 3], ["~", "[1]", 1, 4], ["~", "[2]", 2, 5]] ``` compared to: ``` > HashDiff.diff([0,1,2], [3,4,5]) => [["-", "[2]", 2], ["-", "[1]", 1], ["-", "[0]", 0], ["+", "[0]", 3], ["+", "[1]", 4], ["+", "[2]", 5]] ``` Whereas if there are more items in one array than the other it checks the items surrounding the index for a match to calculate additions and removals. ``` > HashDiff.diff([0, 3, 5], [0, 1, 2, 3, 4, 5], use_lcs: false) => [["+", "[1]", 1], ["+", "[2]", 2], ["+", "[4]", 4]] > HashDiff.diff([0, 3, 5], [0, 1, 2, 3, 4, 5], use_lcs: false) == HashDiff.diff([0, 3, 5], [0, 1, 2, 3, 4, 5]) => true ``` For a combination of added and changed items the diff will appear different to the lcs approach: ``` > HashDiff.diff([0, 1, 2], [0, 2, 2, 3], use_lcs: false) => [["~", "[1]", 1, 2], ["+", "[3]", 3]] > HashDiff.diff([0, 1, 2], [0, 2, 2, 3]) => [["-", "[1]", 1], ["+", "[1]", 2], ["+", "[3]", 3]] ``` However all diffs produce same results through `patch!` and `unpatch!` methods.

Option to allow array comparisons in linear complexity

Documentation for the :use_lcs option

…ruby-1.9.3

update minimum ruby to reflect actual support

liufengyun and others added 30 commits October 6, 2014 20:59

make library 1.8.7 compatible

548df94

update version to 0.2.2

f9cbbeb

[ci skip]

remove demo link

687de5d

Use a_start and b_start variables in HashDiff.lcs

75ed5b5

These vars were already created at line 13 but they weren't used before.

Merge pull request liufengyun#12 from keram/patch-1

6844a34

Use a_start and b_start variables in HashDiff.lcs

bumps version to 0.2.3

bfa0320

Add case insensitive option

45c572d

add :case_insensitive option to README

366d83b

bumps version to 0.3.0

26f6a71

try fix travis test

fcb2b30

Fix bug with array under hash key with non-word characters.

d07ae0a

A hash key with anything not matching \w (word characters) like -|, etc would not match the regex detecting arrays in the patch path and would not patch correctly.

don't test 1.8.7

7935759

Merge pull request liufengyun#18 from eirc/master

fff6fc2

Fix bug with array under hash key with non-word characters.

add test to :delimiter in patch/unpatch

af067d6

fix an error when a hash has mixed types

68425c1

Merge pull request liufengyun#26 from ZeroPointEnergy/fix/mixed_keys

9df2cb5

fix an error when a hash has mixed types

bumps to 0.3.1

a8f5873

New rubies support.

938f747

- Add 2.2, 2.3 and 2.4 to Travis CI. - Modern version of `rake` fails with old RSpec so its version should be restricted. - Fixnum is deprecated in Ruby 2.4.0 so replace it with Integer.

Merge pull request liufengyun#28 from marshall-lee/new-rubies-support

0b37a28

New rubies support.

bumps to 0.3.2

9cacb45

Mention 2 compelling reasons to start using HashDiff

1a4bf75

Merge pull request liufengyun#30 from thbar/patch-1

addd333

Mention 2 compelling reasons to start using HashDiff

Merge pull request liufengyun#31 from cloakedcode/early-return-similar

5751a2d

Greatly improve performance of HashDiff#similar?

bumps to 0.3.4

7588d7d

add codecov gem

7ada0b7

add codecov

c553aa3

Merge pull request liufengyun#33 from stephengroat/patch-1

7271736

add codecov to show coverage

Update patch documentation on README

acb2d7e

The patches suggested in the README only work when you have string keys on the hash.

kevindew and others added 15 commits August 3, 2017 22:49

Fix typo

04e4e8b

s/comparisions/comparisons

Merge pull request liufengyun#34 from kevindew/array_path

5c47aff

Introduce an array_path option

bumps to 0.3.5

83c6f4b

Merge pull request liufengyun#35 from kevindew/linear-arrays

5dea0c7

Option to allow array comparisons in linear complexity

remove code coverage

30a59a8

bumps to 0.3.6

8c40f16

Documentation for the :use_lcs option

59a92b3

Merge pull request liufengyun#36 from kevindew/linear-array-docs

a5e22bb

Documentation for the :use_lcs option

update minimum ruby to reflect actual support

34681b2

set higher retry, bump ruby versions

f153d24

attempting to work around apparently bundler problem with travis and …

70fc43e

…ruby-1.9.3

Merge pull request liufengyun#39 from lostapathy/minimum_ruby

0c00625

update minimum ruby to reflect actual support

bumps to 0.3.7

0946ded

Merge remote-tracking branch 'original/master' into merged

c0dc7b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merged with the original to update the gem, but keep our customizations #1

Merged with the original to update the gem, but keep our customizations #1

grumpit commented Jul 21, 2018

Merged with the original to update the gem, but keep our customizations #1

Are you sure you want to change the base?

Merged with the original to update the gem, but keep our customizations #1

Conversation

grumpit commented Jul 21, 2018