Mercurial > hg4j
annotate src/org/tmatesoft/hg/internal/DiffHelper.java @ 700:6e7786086f77 v1.2m2
Update gradle build to respect new dependency from trilead
| author | Artem Tikhomirov <tikhomirov.artem@gmail.com> | 
|---|---|
| date | Thu, 08 Aug 2013 21:04:52 +0200 | 
| parents | 58a6900f845d | 
| children | 
| rev | line source | 
|---|---|
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 1 /* | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 2 * Copyright (c) 2013 TMate Software Ltd | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 3 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 4 * This program is free software; you can redistribute it and/or modify | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 5 * it under the terms of the GNU General Public License as published by | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 6 * the Free Software Foundation; version 2 of the License. | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 7 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 8 * This program is distributed in the hope that it will be useful, | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 9 * but WITHOUT ANY WARRANTY; without even the implied warranty of | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 10 * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 11 * GNU General Public License for more details. | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 12 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 13 * For information on how to redistribute this software under | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 14 * the terms of a license other than GNU General Public License | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 15 * contact TMate Software at support@hg4j.com | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 16 */ | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 17 package org.tmatesoft.hg.internal; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 18 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 19 import java.util.ArrayList; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 20 import java.util.HashMap; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 21 import java.util.Map; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 22 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 23 /** | 
| 538 
dd4f6311af52
Commit: first working version
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
533diff
changeset | 24 * Mercurial cares about changes only up to the line level, e.g. a simple file version dump in manifest looks like (RevlogDump output): | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 25 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 26 * 522: 233748 0 103 17438 433 522 521 -1 756073cf2321df44d3ed0585f2a5754bc8a1b2f6 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 27 * <PATCH>: | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 28 * 3487..3578, 91:src/org/tmatesoft/hg/core/HgIterateDirection.java\00add61a8a665c5d8f092210767f812fe0d335ac8 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 29 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 30 * I.e. for the {fname}{revision} entry format of manifest, not only {revision} is changed, but the whole line, with unchanged {fname} is recorded | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 31 * in the patch. | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 32 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 33 * Mercurial paper describes reasons for choosing this approach to delta generation, too. | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 34 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 35 * | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 36 * @author Artem Tikhomirov | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 37 * @author TMate Software Ltd. | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 38 */ | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 39 public class DiffHelper<T extends DiffHelper.ChunkSequence<?>> { | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 40 | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 41 private Map<Object, IntVector> chunk2UseIndex; | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 42 private T seq1, seq2; | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 43 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 44 // get filled by #longestMatch, track start of common sequence in seq1 and seq2, respectively | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 45 private int matchStartS1, matchStartS2; | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 46 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 47 private MatchInspector<T> matchInspector; | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 48 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 49 public void init(T s1, T s2) { | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 50 seq1 = s1; | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 51 seq2 = s2; | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 52 prepare(s2); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 53 } | 
| 549 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 54 | 
| 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 55 public void init(T s1) { | 
| 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 56 if (seq2 == null) { | 
| 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 57 throw new IllegalStateException("Use this #init() only when target sequence shall be matched against different origin"); | 
| 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 58 } | 
| 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 59 seq1 = s1; | 
| 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 60 } | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 61 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 62 | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 63 private void prepare(T s2) { | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 64 chunk2UseIndex = new HashMap<Object, IntVector>(); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 65 for (int i = 0, len = s2.chunkCount(); i < len; i++) { | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 66 Object bc = s2.chunk(i); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 67 IntVector loc = chunk2UseIndex.get(bc); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 68 if (loc == null) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 69 chunk2UseIndex.put(bc, loc = new IntVector()); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 70 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 71 loc.add(i); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 72 // bc.registerUseIn(i) - BEWARE, use of bc here is incorrect | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 73 // in this case need to find the only ByteChain to keep indexes | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 74 // i.e. when there are few equal ByteChain instances, notion of "usedIn" shall be either shared (reference same vector) | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 75 // or kept within only one of them | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 76 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 77 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 78 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 79 public void findMatchingBlocks(MatchInspector<T> insp) { | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 80 insp.begin(seq1, seq2); | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 81 matchInspector = insp; | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 82 findMatchingBlocks(0, seq1.chunkCount(), 0, seq2.chunkCount()); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 83 insp.end(); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 84 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 85 | 
| 680 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 86 /** | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 87 * look up every line in s2 that match lines in s1 | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 88 * idea: pure additions in s2 are diff-ed against s1 again and again, to see if there are any matches | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 89 */ | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 90 void findAllMatchAlternatives(final MatchInspector<T> insp) { | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 91 assert seq1.chunkCount() > 0; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 92 final IntSliceSeq insertions = new IntSliceSeq(2); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 93 final boolean matchedAny[] = new boolean[] {false}; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 94 DeltaInspector<T> myInsp = new DeltaInspector<T>() { | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 95 @Override | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 96 protected void unchanged(int s1From, int s2From, int length) { | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 97 matchedAny[0] = true; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 98 insp.match(s1From, s2From, length); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 99 } | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 100 @Override | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 101 protected void added(int s1InsertPoint, int s2From, int s2To) { | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 102 insertions.add(s2From, s2To); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 103 } | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 104 }; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 105 matchInspector = myInsp; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 106 myInsp.begin(seq1, seq2); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 107 IntSliceSeq s2RangesToCheck = new IntSliceSeq(2, 1, 0); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 108 s2RangesToCheck.add(0, seq2.chunkCount()); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 109 do { | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 110 IntSliceSeq nextCheck = new IntSliceSeq(2); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 111 for (IntTuple t : s2RangesToCheck) { | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 112 int s2Start = t.at(0); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 113 int s2End = t.at(1); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 114 myInsp.changeStartS1 = 0; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 115 myInsp.changeStartS2 = s2Start; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 116 insp.begin(seq1, seq2); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 117 matchedAny[0] = false; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 118 findMatchingBlocks(0, seq1.chunkCount(), s2Start, s2End); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 119 insp.end(); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 120 myInsp.end(); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 121 if (matchedAny[0]) { | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 122 nextCheck.addAll(insertions); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 123 } | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 124 insertions.clear(); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 125 } | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 126 s2RangesToCheck = nextCheck; | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 127 } while (s2RangesToCheck.size() > 0); | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 128 } | 
| 
58a6900f845d
Blame: alternative strategy to handle merge revisions: map(diff(p1->base->p2)) to understand merge intentions better
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
624diff
changeset | 129 | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 130 /** | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 131 * implementation based on Python's difflib.py and SequenceMatcher | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 132 */ | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 133 public int longestMatch(int startS1, int endS1, int startS2, int endS2) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 134 matchStartS1 = matchStartS2 = 0; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 135 int maxLength = 0; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 136 IntMap<Integer> chunkIndex2MatchCount = new IntMap<Integer>(8); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 137 for (int i = startS1; i < endS1; i++) { | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 138 Object bc = seq1.chunk(i); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 139 IntVector occurencesInS2 = chunk2UseIndex.get(bc); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 140 if (occurencesInS2 == null) { | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 141 chunkIndex2MatchCount.clear(); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 142 continue; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 143 } | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 144 IntMap<Integer> newChunkIndex2MatchCount = new IntMap<Integer>(8); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 145 for (int j : occurencesInS2.toArray()) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 146 // s1[i] == s2[j] | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 147 if (j < startS2) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 148 continue; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 149 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 150 if (j >= endS2) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 151 break; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 152 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 153 int prevChunkMatches = chunkIndex2MatchCount.containsKey(j-1) ? chunkIndex2MatchCount.get(j-1) : 0; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 154 int k = prevChunkMatches + 1; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 155 newChunkIndex2MatchCount.put(j, k); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 156 if (k > maxLength) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 157 matchStartS1 = i-k+1; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 158 matchStartS2 = j-k+1; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 159 maxLength = k; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 160 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 161 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 162 chunkIndex2MatchCount = newChunkIndex2MatchCount; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 163 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 164 return maxLength; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 165 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 166 | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 167 private void findMatchingBlocks(int startS1, int endS1, int startS2, int endS2) { | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 168 int matchLength = longestMatch(startS1, endS1, startS2, endS2); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 169 if (matchLength > 0) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 170 final int saveStartS1 = matchStartS1, saveStartS2 = matchStartS2; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 171 if (startS1 < matchStartS1 && startS2 < matchStartS2) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 172 findMatchingBlocks(startS1, matchStartS1, startS2, matchStartS2); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 173 } | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 174 matchInspector.match(saveStartS1, saveStartS2, matchLength); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 175 if (saveStartS1+matchLength < endS1 && saveStartS2+matchLength < endS2) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 176 findMatchingBlocks(saveStartS1 + matchLength, endS1, saveStartS2 + matchLength, endS2); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 177 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 178 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 179 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 180 | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 181 public interface MatchInspector<T extends ChunkSequence<?>> { | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 182 void begin(T s1, T s2); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 183 void match(int startSeq1, int startSeq2, int matchLength); | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 184 void end(); | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 185 } | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 186 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 187 static class MatchDumpInspector<T extends ChunkSequence<?>> implements MatchInspector<T> { | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 188 private int matchCount; | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 189 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 190 public void begin(T s1, T s2) { | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 191 matchCount = 0; | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 192 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 193 | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 194 public void match(int startSeq1, int startSeq2, int matchLength) { | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 195 matchCount++; | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 196 System.out.printf("match #%d: from line #%d and line #%d of length %d\n", matchCount, startSeq1, startSeq2, matchLength); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 197 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 198 | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 199 public void end() { | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 200 if (matchCount == 0) { | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 201 System.out.println("NO MATCHES FOUND!"); | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 202 } | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 203 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 204 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 205 | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 206 /** | 
| 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 207 * Matcher implementation that translates "match/equal" notification to a delta-style "added/removed/changed". | 
| 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 208 */ | 
| 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 209 public static class DeltaInspector<T extends ChunkSequence<?>> implements MatchInspector<T> { | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 210 protected int changeStartS1, changeStartS2; | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 211 protected T seq1, seq2; | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 212 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 213 public void begin(T s1, T s2) { | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 214 seq1 = s1; | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 215 seq2 = s2; | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 216 changeStartS1 = changeStartS2 = 0; | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 217 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 218 | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 219 public void match(int startSeq1, int startSeq2, int matchLength) { | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 220 reportDeltaElement(startSeq1, startSeq2, matchLength); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 221 changeStartS1 = startSeq1 + matchLength; | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 222 changeStartS2 = startSeq2 + matchLength; | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 223 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 224 | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 225 public void end() { | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 226 if (changeStartS1 < seq1.chunkCount()-1 || changeStartS2 < seq2.chunkCount()-1) { | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 227 reportDeltaElement(seq1.chunkCount()-1, seq2.chunkCount()-1, 0); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 228 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 229 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 230 | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 231 protected void reportDeltaElement(int matchStartSeq1, int matchStartSeq2, int matchLength) { | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 232 if (changeStartS1 < matchStartSeq1) { | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 233 if (changeStartS2 < matchStartSeq2) { | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 234 changed(changeStartS1, matchStartSeq1, changeStartS2, matchStartSeq2); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 235 } else { | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 236 assert changeStartS2 == matchStartSeq2; | 
| 543 
1e95f48d9886
Report line index for insertion and deletion, test against 'hg diff' output
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
542diff
changeset | 237 deleted(matchStartSeq2, changeStartS1, matchStartSeq1); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 238 } | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 239 } else { | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 240 assert changeStartS1 == matchStartSeq1; | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 241 if(changeStartS2 < matchStartSeq2) { | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 242 added(changeStartS1, changeStartS2, matchStartSeq2); | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 243 } else { | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 244 assert changeStartS2 == matchStartSeq2; | 
| 549 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 245 if (matchStartSeq1 > 0 || matchStartSeq2 > 0) { | 
| 624 
507602cb4fb3
FIXMEs and TODOs: pay some technical debt
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
583diff
changeset | 246 assert false : String.format("adjustent equal blocks %d, %d and %d,%d", changeStartS1, matchStartSeq1, changeStartS2, matchStartSeq2); | 
| 549 
83afa680555d
Annotate merge revision (combined diff against two parents without looking further)
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
545diff
changeset | 247 } | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 248 } | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 249 } | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 250 if (matchLength > 0) { | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 251 unchanged(matchStartSeq1, matchStartSeq2, matchLength); | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 252 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 253 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 254 | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 255 /** | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 256 * [s1From..s1To) replaced with [s2From..s2To) | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 257 */ | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 258 protected void changed(int s1From, int s1To, int s2From, int s2To) { | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 259 // NO-OP | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 260 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 261 | 
| 543 
1e95f48d9886
Report line index for insertion and deletion, test against 'hg diff' output
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
542diff
changeset | 262 protected void deleted(int s2DeletePoint, int s1From, int s1To) { | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 263 // NO-OP | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 264 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 265 | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 266 protected void added(int s1InsertPoint, int s2From, int s2To) { | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 267 // NO-OP | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 268 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 269 | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 270 protected void unchanged(int s1From, int s2From, int length) { | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 271 // NO-OP | 
| 541 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 272 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 273 } | 
| 
946b13196252
PatchGenerator: refactoring to facilitate use in annotate/blame
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
538diff
changeset | 274 | 
| 583 
47dfa0ec7e35
Effective revlog patching
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
556diff
changeset | 275 public static class DeltaDumpInspector<T extends ChunkSequence<?>> extends DeltaInspector<T> { | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 276 | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 277 @Override | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 278 protected void changed(int s1From, int s1To, int s2From, int s2To) { | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 279 System.out.printf("changed [%d..%d) with [%d..%d)\n", s1From, s1To, s2From, s2To); | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 280 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 281 | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 282 @Override | 
| 543 
1e95f48d9886
Report line index for insertion and deletion, test against 'hg diff' output
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
542diff
changeset | 283 protected void deleted(int s2DeletionPoint, int s1From, int s1To) { | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 284 System.out.printf("deleted [%d..%d)\n", s1From, s1To); | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 285 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 286 | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 287 @Override | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 288 protected void added(int s1InsertPoint, int s2From, int s2To) { | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 289 System.out.printf("added [%d..%d) at %d\n", s2From, s2To, s1InsertPoint); | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 290 } | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 291 | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 292 @Override | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 293 protected void unchanged(int s1From, int s2From, int length) { | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 294 System.out.printf("same [%d..%d) and [%d..%d)\n", s1From, s1From + length, s2From, s2From + length); | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 295 } | 
| 542 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 296 } | 
| 
a71a05ec11bc
Towards annotate/blame support: general outline of the functionality
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
541diff
changeset | 297 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 298 /** | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 299 * Generic sequence of chunk, where chunk is anything comparable to another chunk, e.g. a string or a single char | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 300 * Sequence diff algorithm above doesn't care about sequence nature. | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 301 */ | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 302 public interface ChunkSequence<T> { | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 303 public T chunk(int index); | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 304 public int chunkCount(); | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 305 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 306 | 
| 551 
4ea0351ca878
Better (precise) name for diff facility, tests
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
549diff
changeset | 307 public static final class LineSequence implements ChunkSequence<LineSequence.ByteChain> { | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 308 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 309 private final byte[] input; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 310 private ArrayList<ByteChain> lines; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 311 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 312 public LineSequence(byte[] data) { | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 313 input = data; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 314 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 315 | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 316 public static LineSequence newlines(byte[] array) { | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 317 return new LineSequence(array).splitByNewlines(); | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 318 } | 
| 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 319 | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 320 // sequence ends with fake, empty line chunk | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 321 public LineSequence splitByNewlines() { | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 322 lines = new ArrayList<ByteChain>(); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 323 int lastStart = 0; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 324 for (int i = 0; i < input.length; i++) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 325 if (input[i] == '\n') { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 326 lines.add(new ByteChain(lastStart, i+1)); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 327 lastStart = i+1; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 328 } else if (input[i] == '\r') { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 329 if (i+1 < input.length && input[i+1] == '\n') { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 330 i++; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 331 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 332 lines.add(new ByteChain(lastStart, i+1)); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 333 lastStart = i+1; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 334 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 335 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 336 if (lastStart < input.length) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 337 lines.add(new ByteChain(lastStart, input.length)); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 338 } | 
| 538 
dd4f6311af52
Commit: first working version
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
533diff
changeset | 339 // empty chunk to keep offset of input end | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 340 lines.add(new ByteChain(input.length)); | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 341 return this; | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 342 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 343 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 344 public ByteChain chunk(int index) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 345 return lines.get(index); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 346 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 347 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 348 public int chunkCount() { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 349 return lines.size(); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 350 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 351 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 352 public byte[] data(int chunkFrom, int chunkTo) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 353 if (chunkFrom == chunkTo) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 354 return new byte[0]; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 355 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 356 int from = chunk(chunkFrom).getOffset(), to = chunk(chunkTo).getOffset(); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 357 byte[] rv = new byte[to - from]; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 358 System.arraycopy(input, from, rv, 0, rv.length); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 359 return rv; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 360 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 361 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 362 | 
| 556 
e55f17a7a195
AnnotateFacility renamed to HgBlameFacility and exposed, API shapes out and got some javadoc
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
551diff
changeset | 363 public final class ByteChain { | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 364 private final int start, end; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 365 private final int hash; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 366 | 
| 545 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 367 /** | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 368 * construct a chunk with a sole purpose to keep | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 369 * offset of the data end | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 370 */ | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 371 ByteChain(int offset) { | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 372 start = end = offset; | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 373 // ensure this chunk doesn't match trailing chunk of another sequence | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 374 hash = System.identityHashCode(this); | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 375 } | 
| 
15b406c7cd9d
First round of annotate file is functional
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
544diff
changeset | 376 | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 377 ByteChain(int s, int e) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 378 start = s; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 379 end = e; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 380 hash = calcHash(input, s, e); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 381 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 382 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 383 /** | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 384 * byte offset of the this ByteChain inside ChainSequence | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 385 */ | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 386 public int getOffset() { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 387 return start; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 388 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 389 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 390 public byte[] data() { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 391 byte[] rv = new byte[end - start]; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 392 System.arraycopy(input, start, rv, 0, rv.length); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 393 return rv; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 394 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 395 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 396 @Override | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 397 public boolean equals(Object obj) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 398 if (obj == null || obj.getClass() != ByteChain.class) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 399 return false; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 400 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 401 ByteChain other = (ByteChain) obj; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 402 if (other.hash != hash || other.end - other.start != end - start) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 403 return false; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 404 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 405 return other.match(input, start); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 406 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 407 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 408 private boolean match(byte[] oi, int from) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 409 for (int i = start, j = from; i < end; i++, j++) { | 
| 544 
7f5998a9619d
Refactor PatchGenerator to be generic and welcome sequence of any nature
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: 
543diff
changeset | 410 if (LineSequence.this.input[i] != oi[j]) { | 
| 533 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 411 return false; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 412 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 413 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 414 return true; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 415 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 416 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 417 @Override | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 418 public int hashCode() { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 419 return hash; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 420 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 421 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 422 @Override | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 423 public String toString() { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 424 return String.format("[@%d\"%s\"]", start, new String(data())); | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 425 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 426 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 427 | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 428 // same as Arrays.hashCode(byte[]), just for a slice of a bigger array | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 429 static int calcHash(byte[] data, int from, int to) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 430 int result = 1; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 431 for (int i = from; i < to; i++) { | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 432 result = 31 * result + data[i]; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 433 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 434 return result; | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 435 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 436 } | 
| 
e6f72c9829a6
Generate patches using diff algorithm
 Artem Tikhomirov <tikhomirov.artem@gmail.com> parents: diff
changeset | 437 } | 
