http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/opennlp-similarity/src/test/resources/style_recognizer/txt/Wind/139WindRiadhEt_ContrarotatingConverter_EN.txt.txt
----------------------------------------------------------------------
diff --git 
a/opennlp-similarity/src/test/resources/style_recognizer/txt/Wind/139WindRiadhEt_ContrarotatingConverter_EN.txt.txt
 
b/opennlp-similarity/src/test/resources/style_recognizer/txt/Wind/139WindRiadhEt_ContrarotatingConverter_EN.txt.txt
new file mode 100644
index 0000000..a073ed1
--- /dev/null
+++ 
b/opennlp-similarity/src/test/resources/style_recognizer/txt/Wind/139WindRiadhEt_ContrarotatingConverter_EN.txt.txt
@@ -0,0 +1,2 @@
+
+ Performance of a Contrarotating Small Wind Energy Converter 1. 
Introduction Wind energy has been shown to be one of the most feasible sources 
of renewable energy . It presents attractive opportunities to a wide range of 
people , including investors and entrepreneurs . The main goal of wind energy 
industry is to minimize the cost of wind energy in order to make it more 
competitive compared to other energy sources . How to reduce the cost of wind 
energy is a vital engineering challenge presented by the interlocking 
disciplines of aerodynamics , structure , control , electrical conversion , and 
electronics . In fact , technologies in these related areas are still under 
active research and development to achieve high efficiency and low cost . In 
the shadows of advancing multimegawatt wind turbines is another growing sector 
within this industry , the small wind turbines . Small wind energy converters ( 
SWECs ) for urban or rural applications range in size from a few hundred watts 
to 
 thousands of watts ( usually with a rated capacity of less than 100 kW ) and 
can be applied economically for a variety of power demands . These systems can 
be used in connection with an electricity transmission and distribution system 
, or in stand-alone applications that are not connected to the utility grid and 
are appropriate for homes , farms , or even entire communities . Investments in 
this sector are feasible not as stand-alone only , but as components of an 
integrated power-generating system that include various forms of energy 
resources . The main technical challenges for SWECs are the design of a system 
that has maximum efficiency in turbulent low speed winds ; ability to comply 
with both efficiency expectations and the requirements of grid utilities ; and 
have the minimum environmental and health impacts in terms of noise and 
vibration . Two facts characterize the urban environment for wind energy : 
lower annual mean wind speed ( AMWS ) compared to rural areas or to sea s
 hores and more turbulent flow . The low AMWS is related to the uneven ground 
created by buildings and other features of the urban landscape , which causes 
wind speeds to increase with height above the ground more slowly . The 
turbulent flow is a result of the wind interacting with landscape obstacles , 
the fact that applies extra stress on the turbine blades . The challenge is to 
develop wind turbines that operate at lower speeds and cope with the turbulent 
. The wind generating technology development is leading to improved performance 
and efficiency . Most wind turbines are single-rotor systems , which provide 
simplicity , reliability , and durability . Along the years , improvements have 
enhanced energy conversion efficiency of these single-rotor systems . For 
example , blades have better aerodynamic characteristics , gears with reduced 
noise have better torque transmission efficiency , and alternators have better 
electrical efficiency . However , despite these improvements , sing
 le-rotor systems are able to convert only a small fraction of the total wind 
stream energy into electrical energy . Moreover , such a system requires high 
wind velocity ( above 4 m/s ) which is not available in many places , a part 
from costal regions . This low velocity and seasonal winds imply a high cost of 
exploitation of wind energy . Thus , the challenge lies with the design of a 
wind generator which can operate at lower speeds and be used in a small-scale 
manner in remote and rural areas . This paper investigates the performance of 
the SWEC basing on wind tunnel tests . The paper is organized as follows . 
Section 2 discusses the contrarotating concepts and provides a literature 
review on the subject , while Section 3 presents the theory of rotor torque and 
power . Section 4 describes the wind tunnel experimental setup , and Section 5 
presents the rotor performance results . Finally , conclusions are drawn in 
Section 6. 2. Contrarotating Blade System The prime mover in wind en
 ergy system is the wind turbine . One prevailing trend in wind turbine 
technology throughout the past couple of decades has been growth in the size of 
the rotor to realize the advantages of scale and the generally higher winds 
available at greater heights . Geometrically , consistent upscaling of blade 
length shows that the surface stresses at the blade surface , vibratory loads , 
and loading noise due to aerodynamical and gravitational loads grow in 
proportion to the length of the blade [ 1 ] . However , an alternative mean of 
overcoming the limitation of the efficiency of a single-rotor system without 
increasing the size of the rotor and consequently the stress on blades could be 
through the adoption of a dual-rotor ( contrarotating ) blade system . In 
addition , the acceptance of wind turbines by the public depends strongly on 
achieving low noise levels in operation , which largely depends on the level of 
stress on the blades . According to Betz theory , the maximum power that ca
 n be extracted from the wind is about 59 % of the available energy in the wind 
when the axial wind speed is reduced by two-thirds across a single rotor disc . 
However , practical wind turbines convert less than 40 % of the wind energy 
into electrical energy . Hence , nearly 60 % of the potential wind energy 
escapes without being harnessed . In reality , the energy in the wake behind a 
single rotor is not very small . Part of this energy may be extracted further 
by installing a second rotor in the wake . As the wake behind the first rotor 
is rotating in the opposite direction to the rotational direction of the rotor 
, the second rotor should rotate in the same direction as the wake in order to 
extract efficiently the available energy in the wake . The contrarotating 
system is a very old concept that was initially proposed more than 100 years 
ago . A friend of Betz who is sometimes described as the “ father of modern 
wind energy collection theory , ” Hans Honneff , wrote a book on
  the use of contrarotation , using two rotors one behind the other , driving 
the two halves of an electrical generator , therefore creating a true wind 
turbine [ 2 ] . Currently , the contra concept is used on airplanes , boats , 
and submarines to increase efficiency while eliminating the asymmetrical torque 
faced by conventional rotors . A dual-rotor system can be described as a system 
consisting of two rotors separated by an appropriate distance ( Figure 1 ) . 
One of the rotors is rotating in counterclockwise direction and the other in 
clockwise direction on the same axis . The relative size as well as the 
appropriate distance between the two rotors should be identified for best 
performance . Drawbacks of the dual-rotor system come from mechanical 
complexity based on the fact that in order to reverse direction of rotation of 
one rotor , a gearbox is required . This may increase weight or maintenance and 
spare parts cost for the system . Based on evidence in literature , aerodynami
 c research is poised between experimental and computational : either the wind 
turbine is studied experimentally in a wind tunnel , or the turbine is 
investigated computationally using methods that belong to the field of 
computational fluid dynamics ( CFD ) . The two are closely linked , and as 
progress is made in the development of more advanced computational fluid models 
, more comprehensive wind tunnel experimental data is required to validate the 
models . Experimental and computational research provide results for better 
understanding of the flow physics and enable investigation of wind energy 
performance , a requirement in order to adjust the design of wind turbines to 
the unique aerodynamic conditions in the environment . As with all methods of 
analysis , the CFD approach has limitations which are essentially related to 
turbulence modeling . Sumner et al . [ 3 ] review the development of CFD as a 
virtual , multiscale wind tunnel applied by the wind energy community from 
small t
 o large scale . Although the cost of a CFD analysis may be comparable to that 
of a wind tunnel experiment , we considered the wind tunnel experimental option 
for the current study emphasizing on the importance of transition to turbulence 
effects . Typically , wind tunnel tests overstate performance , and consumers 
will never see the performance measured in a wind tunnel . However , such tests 
are good indicators of performance . To our knowledge , only a limited number 
of wind tunnel studies can be found in literature [ 2 , 4 ] . In order to study 
the streamlines and obtain the detailed information of flow around the wind 
turbine , a flow visualization and velocity measurement are important . 
Investigation [ 5 ] has been carried out for this sake . Considerable 
improvements in the understanding of contrarotating wind turbine system can be 
achieved through proper instrumentation and experimental measurements . 
According to [ 6 ] , the maximum power that can be extracted from a dual-r
 otor system increases up to 64 % of the available energy . It continues to 
reach 66.7 % for an infinite number of rotors [ 7 ] . A contrarotating wind 
turbine equipped with two 500 kW turbines performed quite well at high wind 
speeds . The turbine can produce 43.5 % more annual energy than a single-rotor 
turbine of the same type . The performance of the system can be improved if it 
is operated for low wind speeds at the tip-speed ratio where a maximum Cp is 
obtained [ 8 ] . Research studies provide sufficient evidence to look closer at 
the concept of contrarotating system to eventually produce quantifiable 
comparisons to other turbines [ 9 , 10 ] . A smaller gear ratio is needed 
because of higher tip speeds achieved by smaller blade length in comparison 
with the conventional system in case of the same power output . Energy capture 
in the rotor holds the greatest potential for long-term reduction of the cost 
of wind energy . A feasibility study [ 11 ] provides sufficient evidence to 
 look closer at the concept of contrarotating to eventually produce 
quantifiable comparisons to other turbines . Their field tests showed that a 
dual-rotor turbine produces 43.5 % more annual energy than a single-rotor 
turbine of the same type . In addition , a smaller gear ratio is needed because 
of higher tip speeds achieved by smaller blade length in comparison with the 
conventional system in case of the same power output [ 12 ] . According to a 
field test demonstrated in California [ 13 ] , energy extraction from a wind 
turbine using contrarotating system increased by up to 40 % over an equivalent 
wind turbine with only one rotor . Power conversion efficiency was high at low 
rotor speeds , suggesting applicability of contrarotating turbines to large 
utility-scale wind turbines that rotate at 16–20 rpm . In addition , the 
bending stress on the supporting tower was reduced by the contrarotating system 
over the single-rotor system . This reduced bending stress results when the tor
 ques produced by two rotors counterbalance each other . The contrarotating 
SWEC clearly has a promise for wind energy , and after preliminary research and 
field studies [ 6–13 ] , it was decided to proceed with a small SWEC 
prototype for testing and evaluation . 3. Rotor Torque and Power The motion of 
any fluid can be derived from the basic physical principles of mass , momentum 
, and energy interchange . The torque responsible for power production of the 
wind turbine mostly arises due to the forces produced by interaction of blades 
with the wind . The output power 𝑃 𝑇 from a turbine rotor and the wind 
kinetic energy per unit time 𝑃 𝑊 are given as follows : 𝑃 𝑇 = 𝑇 
𝑚 𝑃 × 𝜔 , 𝑊 = 1 2 𝜌 × 𝑉 3 0 × 𝐴 , ( 1 ) where 𝑇 𝑚 
is the mechanical torque at the turbine side , 𝜔 is the angular rotation of 
the shaft , 𝜌 is the air density at the hub height , 𝑉 0 is the wind 
velocity , and 𝐴 is the swept area of the blades . If momentum 
 equation is solved across an idealized control volume about the turbine rotor 
, it can be shown that the percentage of the total power available that can be 
extracted by a turbine is 16/27 or 59 % . This limit is known as the Betz limit 
. Therefore , the maximum power that a turbine can produce is expressed as 
follows [ 14 ] : 𝑃 𝑊 =  1 6 1 2 7   2  𝜌 × 𝑉 3 0 × 
𝐴 . ( 2 ) Most turbines extract the maximum possible energy as defined above 
for lower wind speeds but gradually become less efficient as the on-coming wind 
speed increases and the flow condition across the blades approaches the stall 
condition . The rotor power coefficient 𝐶 𝑝 is defined as the ratio 
between the rotor output power and the dynamic power of the air as shown in the 
following : 𝐶 𝑝 = 𝑃 𝑇 𝑃 𝑊 = 𝑇 𝑚 × 𝜔  ( 1 / 2 ) 
𝜌 × 𝑉 3 0  × 𝐴 . ( 3 ) The power coefficient is a nonlinear 
function of the tip speed-ratio 𝜆 , which depends on the wind
  velocity and the rotation speed of the shaft 𝑉 𝜆 = T i p 𝑉 0 = 𝑟 
× 𝜔 𝑉 0 , ( 4 ) where 𝑟 is the rotor radius . The rotor power 
coefficient is regarded as the energy transformation efficiency . Note that 
𝐶 𝑝 is usually precomputed based on the theoretically expected 
performance of the turbine system . The wind turbine mechanical characteristics 
are described by the following equation ( where the turbine rotor friction is 
ignored ) : 𝑇 𝑚 − 𝑇 𝑔 = 𝐽 𝑑 𝜔 𝑑 𝑡 , ( 5 ) where 
𝑇 𝑔 is the load torque , and 𝐽 is the turbine inertia moment . The 
incoming wind flow rate should be equal to the outgoing flow rate to satisfy 
the mass conservation law if a control volume around a turbine is assumed . The 
outgoing wind-speed distribution and its direction strongly determine the 
turbine efficiency . Figure 2 shows the geometry of the stream tube through the 
disk . Neglecting fluid drags , the power extracted from the air stream can be w
 ritten as 1 𝑃 = ( 6 ) where 𝑉 , and 𝑉 are the flow velocity 
components along the axis of the stream tube . The power coefficient is 
obtained by nondimensionalizing the above power equation as 𝐶 , ( 7 ) where 
𝑎 is the axial induction factor . 4. Wind Tunnel Experimental Setup In this 
section , laboratory measurement techniques are discussed ; however , some of 
the methods used are conventional and require little elaboration . 4.1 . Wind 
Tunnel Facility An open-return type wind tunnel is used in the present study . 
A contrarotating model 3-blade wind turbine was placed in the boundary-layer 
wind tunnel with the goal of studying power performance , turbulence effect , 
and flow visualization . Figure 3 shows the schematic of the wind tunnel 
experimental setup where the contraction and test sections are on the right 
hand side , and the motor and fan are in the left hand side . Air enters the 
fan from the laboratory through a large gate covered by a filter , held by wire 
me
 shes . The air flow is driven by a propulsion system made of an axial fan to 
provide the dynamic pressure for compensating viscous losses . There are smooth 
glass walls on both sides of the tunnel , and access is possible through the 
plywood ceiling and floor . Any large obstruction placed within a wind tunnel 
will alter the characteristics of the flow to some degree . The wind tunnel is 
capable of generating wind speeds up to 30 m/s . This suction type wind tunnel 
has a cross-section of 0.61 m width by 0.9 m height . The tunnel has a working 
( test ) section of length 3.6 m. As the test section is the narrowest part of 
the circuit , it is also the part where the air velocity is the highest and , 
therefore , by Bernoulli’s principle , where the pressure is the lowest . The 
main distinguishing feature of this wind tunnel is that it was designed to 
produce a low level of turbulence in the test section . Power for the tunnel 
comes from a three-phase AC motor of 30 hp at 1800 rpm with
  a maximum speed of 1170 rpm , driving a 10-bladed fan of 54 inches diameter 
with blade setting of 23° , mounted in a cylindrical steel casing . To 
minimise noise and vibration , the casing is supported on rubber shock mounts 
and is connected by flexible seals to the tunnel on either side . The air speed 
does not change as the air passes through the fan . The rotational speed of the 
fan is controlled by a regulated magnetic field and solid-state power supply . 
In order to control the ambient turbulence level , turbulence manipulators are 
placed upstream of the rotor , including a fine mesh screen and an aluminum 
honeycomb section . Smoothing is provided by the fine mesh screen . The 
honeycomb plays the role of a flow straightener . When the wind turbine is 
stopped , the mean velocity over the center portion of the wind tunnel is 
uniform and almost steady . 4.2 . Instrumentation A small model SWEC with two 
blade sets of 23 cm diameter and a varying distance between the blade sets of
  7–54 cm has been built and tested over a range of operating conditions . In 
order to introduce some degree of uniformity into the way in which users of the 
wind tunnel record their data , an instrumentation system to measure and 
display a number of variables that are normally required for all experiments 
was installed . Two guide rails were used to hold the SWEC inside wind tunnel 
floor along the centreline using a steel mounting system . The steel mounting 
system ensured that the system did not move during testing . Measuring sensors 
were mounted at different locations of the setup . The upwind and downwind 
velocities are measured by pitot tubes , which use Bernoulli’s principle to 
convert pressure to velocity readings . The tubes are attached to 2 sensors to 
convert pressure in volts to velocities in m/s . For measuring the rotational 
speed of the rotor , two infrared detector and emitter units ( photogate 
sensors ) were used . They were mounted behind the rotor . Measurement
  depends largely on a data acquisition system utilizing electronic measuring 
to read instantaneous power produced by the generating system , as various 
parameters are varied on the turbine or in the environment . The parameters 
varied include the distance between the two sets of blades , blade profiles , 
number of blades , wind speeds , and size ratios . To accomplish the objective 
of this test , three aspects of experimental setup are needed : mechanical , 
electrical , and measurement software . All sensors are powered , grounded , 
and connected to the data acquisition board . All wires are shielded for 
protection against noise . Measurements are monitored directly and 
instantaneously in the Graphical User Interface ( GUI ) of LabView . The user 
enters numerical values of the blade distance , blade pitch , and blade 
diameter for the front and back and the relative humidity and temperature . The 
circuit has 5 sets of measurements on both the front and the back of the 
generating syst
 em . The voltages are measured directly from the potentiometers ; these are 
the total voltages of the circuits . The currents are obtained by measuring the 
voltages from fixed resistors and dividing that by the resistance . The power 
is the product of the voltage and the calculated current . The rpm signals go 
through a frequency measurement tool in LabView and are then multiplied by 60 
to obtain the angular velocity in revolutions per minute ( rpm ) . All lines of 
measurements are connected to the National Instruments Data Acquisition Board 
NIDAQ USB-6210 . Each line is connected to an analog pin which is fed into the 
LabView program with a USB connection to the computer . At the beginning of the 
measurement process , all sensors were checked and calibrated . The pitot tubes 
are corrected by the offset values to give zero when there is no wind in the 
tunnel . When starting the program , a path is requested for an Excel file to 
record the data . 
\ No newline at end of file

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bact/311bacte\";
 url=\"http:__vue.org.uk_carlos.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/baiw/312baiwc\";
 
url=\"http:__www.sleafordtownfc.co.uk_archives_archived_game.asp?MatchID=89&Season=2002_03.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bays/313baysn\";
 url=\"http:__www.portscathoholidays.co.uk_ShowDetails.asp?id=96.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bbgk/314bbgkl\";
 url=\"http:__www.homezonenews.org.uk_news_news_detail.asp?nid=22.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bbgl/315bbglz\";
 url=\"http:__www.benhs.org.uk_anex.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bcke/316bcked\";
 
url=\"http:__www.fancy-rats.co.uk_information_guides_guides.php?subject=ratsthatbite.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bcnk/317bcnko\";
 
url=\"http:__www.ombudsman.org.uk_improving_services_selected_cases_PCA_sc9903_c682b.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bdac/318bdacj\";
 url=\"http:__www.mml.cam.ac.uk_call_translation_toolkit_6_.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bdat/252bdat\";
 url=\"http:__www.snh.org.uk_nnr-scotland_news_detail.asp?newsID=79.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bdrl/319bdrlk\";
 url=\"http:__www.mubs.mdx.ac.uk_Conferences_BPCSR05_submission.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bdsv/320bdsvh\";
 url=\"http:__www.herts24.co.uk_flatfiles_paulpearcetributes.aspx.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/belc/321belcb\";
 url=\"http:__www.blackpresence.co.uk_phpBB2_viewtopic.php?t=97.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/beoe/322beoeg\";
 url=\"http:__www.viploan.co.uk_article_Mortgages-1212.shtml.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bfqb/323bfqbt\";
 url=\"http:__www.brainbashers.co.uk_droodlesprev.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bgck/324bgckh\";
 url=\"http:__www.photonics.org.uk_newsletter_NoticeBoard.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bgnd/325bgndn\";
 url=\"http:__www.snh.org.uk_calendar_jul.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bgpw/326bgpwt\";
 
url=\"http:__www.cv-library.co.uk_localjobs_Northamptonshire_jobs-in-Brackley.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bgvt/327bgvth\";
 
url=\"https:__secure.bfi.org.uk_features_ultimatefilm_chart_details.php?ranking=65.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bhdt/328bhdtl\";
 url=\"http:__eurocomms.co.uk_online_pr_online_pr.ehtml?o=1647.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bhtu/329bhtur\";
 
url=\"http:__www.inverness-courier.co.uk_news_fullstory.php_aid_809_Tackling_human_organ_donation_dilemma_.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bhvh/330bhvhh\";
 url=\"http:__www.sscs.bham.ac.uk_phsi_eating_bmi.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/biec/331bieca\";
 
url=\"http:__www.ombudsman.org.uk_improving_services_selected_cases_HSC_IC0107_pt1-e2242.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bjcq/332bjcqp\";
 url=\"http:__www.bfice.org.uk_index.asp?contentid=21&menuid=21.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bjyh/333bjyhd\";
 url=\"http:__www.ebe.org.uk_ccn.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bjzd/334bjzdi\";
 url=\"http:__www.northumberland.gov.uk_vg_text_northpen.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bkeb/335bkebq\";
 
url=\"http:__www.vam.ac.uk_res_cons_research_research_reports_1992_theatre_museum_index.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bknb/336bknbk\";
 url=\"http:__easyweb.easynet.co.uk_jim.shead_River-Arun.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bksu/337bksuh\";
 url=\"http:__www.lawson-cruttenden.co.uk_conveyancing.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bkvb/338bkvbo\";
 url=\"http:__www.hamradio.co.uk_acatalog_Vert_Arno_Ant.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bkxn/339bkxng\";
 url=\"http:__travel.independent.co.uk_europe_article1192096.ece.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/blun/340blunb\";
 url=\"http:__www.nsbapty.co.uk_Supp-Samp.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bmbk/341bmbka\";
 
url=\"http:__www.shakespeare-country.co.uk_swt.aspx?&cp=.._swt_&cg=_&sim=&id=487&pagetype=27.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bmwr/342bmwro\";
 url=\"http:__www.surf4wine.co.uk_Eben_Sadie.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bnaq/343bnaqt\";
 
url=\"http:__jobsearch.localgov.monster.co.uk_getjob.asp?JobID=46663258&AVSDM=2006%2D08%2D10+09%3A45%3A00&Logo=0&sort=cp&pg=1.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bnqe/253bnqe\";
 url=\"http:__www.ecodyfi.org.uk_commfirstactionplanpr.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bofv/254bofv\";
 url=\"http:__www.sefton.gov.uk_page&3630.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bopl/344boplu\";
 url=\"http:__www.dw-perspective.org.uk_dwboard_messages_112.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/botu/345botuf\";
 url=\"http:__www.landforsale-investment.org.uk_Plot-Sales.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bova/346bovad\";
 url=\"http:__www.expertcardirectory.co.uk_car-leasing-jamjar.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bped/347bpedf\";
 url=\"http:__www.nta.nhs.uk_news_020624.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bpjo/348bpjoh\";
 url=\"http:__www1.city.ac.uk_law_lawpages_Victim_Support.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bpsf/349bpsfu\";
 url=\"http:__www.industrialnetworking.co.uk_mag_v7-2_p7.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bqbn/350bqbnl\";
 url=\"http:__www.cedr.co.uk_index.php?location=_news_archive_20040628.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bqcc/351bqccv\";
 url=\"http:__www.idler.co.uk_archives_?page_id=18.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bqsw/352bqswt\";
 url=\"http:__www.donhost.co.uk_support_index.pl?page=mailboxes.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bqxq/353bqxqv\";
 
url=\"http:__backstage.bbc.co.uk_news_archives_2005_11_backstagebbccou_2.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/brcu/255brcu\";
 url=\"http:__www.motheratwork.co.uk_Health_default.asp?article=135.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/brjy/354brjyh\";
 url=\"http:__www.learningexperience.org.uk_learning_first.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bsaj/355bsajb\";
 url=\"http:__www.chortle.co.uk_edfest2006_terrysaunders.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bspm/356bspmu\";
 url=\"http:__www.cps.gov.uk_legal_section21_chapter_f.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bssg/357bssga\";
 url=\"http:__www.tropicalfishcentre.co.uk_Plants.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/btab/358btabi\";
 url=\"http:__www.bba.org.uk_bba_jsp_polopoly.jsp?d=155&a=493.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/btbb/359btbbg\";
 url=\"http:__www.burpham.surrey.sch.uk_potter.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bthk/360bthkw\";
 
url=\"http:__www.ttrb.ac.uk_viewArticle.aspx?categoryId=14542&taggingType=4&contentId=11208.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bthx/361bthxs\";
 url=\"http:__www.rvrcd.co.uk_catalogue_walker_walkerreviews.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/btlm/362btlmk\";
 
url=\"http:__www.birdtours.co.uk_tripreports_Spain_andalucia6_and-oct-03.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/btsm/363btsmp\";
 url=\"http:__www.trainingservicesindex.co.uk_newsletter_aug04.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/btsx/364btsxl\";
 
url=\"http:__www.mediaweek.co.uk_search_index.cfm?fuseaction=details&nNewsID=560539.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/btvt/365btvti\";
 url=\"http:__www.i-dj.co.uk_artists_artistspage.php?ID=204&page=3.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bukc/366bukcu\";
 url=\"http:__www.ukpages.freewire.co.uk_buying-property-continent.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bvai/367bvaif\";
 url=\"http:__www.uservision.co.uk_usability_articles_print_wud.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bvha/368bvhab\";
 
url=\"http:__agrifor.ac.uk_browse_cabi_3736cbd2e5895cf49854f8d70494bae7.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bvqz/369bvqzk\";
 url=\"http:__www.elsham.pwp.blueyonder.co.uk_cx500_oil_pump_.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bvxo/370bvxom\";
 url=\"http:__www.schools.co.uk_index.php?name=News&file=article&sid=34.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bwal/371bwale\";
 url=\"http:__www.poptel.org.uk_scgn_articles_9902_inbrief.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bwvh/372bwvhf\";
 url=\"http:__www.tameside.gov.uk_tmbc6_cycling_withoutmycar.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bycu/373bycul\";
 
url=\"http:__www.mubs.mdx.ac.uk_Staff_Personal_pages_Ifan1_Booth_Notebooks.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/byeg/374byegq\";
 
url=\"http:__www.thehealthierlife.co.uk_article_3603_reduce-cancer-reoccurrence.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bykk/375bykkm\";
 url=\"http:__jobs.leaddiscovery.co.uk_job.aspx?jid=11535&cd=1.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bynx/376bynxp\";
 url=\"http:__www.syscom.plc.uk_solutions_distrib.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bytv/377bytvq\";
 
url=\"http:__personalfinance.iii.co.uk_articles_articledisplay.jsp?section=Banking&article_id=64923.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bzam/378bzamy\";
 url=\"http:__www.evolutec.co.uk_06_chairman.asp?thesub=6.0.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bzfk/379bzfki\";
 url=\"http:__www.truststfc.co.uk_meeting_27_09_2006.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bzic/380bzicn\";
 
url=\"http:__www.macintyrecharity.org.uk_transition_personal_experiences_michael.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bziz/381bzizf\";
 
url=\"http:__union.ic.ac.uk_scc_icsf_library_library_history_library_history_3.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bzlq/382bzlqz\";
 url=\"http:__www.socialistunitynetwork.co.uk_news_g8jepps.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bzor/383bzors\";
 url=\"http:__www.weirdwiltshire.co.uk_250703.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bzsx/384bzsxn\";
 
url=\"http:__www.tsha.nhs.uk_modernising-healthcare-in-trent_the-local-supervising-authority-midwifery_lsa-guidelines_maternal-deaths.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/bzxm/385bzxmk\";
 
url=\"http:__www.buildingproductexpert.co.uk_ExpandedEntries_expandedentry.asp?cid=212046&cname=Mark+Simpkin+Ltd&frmBPE=&frmCD=N&mopt=dpe&dpid=2302.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cahe/386cahel\";
 url=\"http:__www.siba.co.uk_about.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cbfp/387cbfpd\";
 
url=\"https:__secure.advanceperformance.co.uk_acatalog_Men_s_Wave_Nirvana_2_Mizuno_Running_Shoes.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cbuk/388cbukt\";
 url=\"http:__www.deafnessresearch.org.uk_?lid=1944&tmpl=ddmainprint.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cbxv/389cbxvh\";
 url=\"http:__www.port.ac.uk_departments_services_campusenvironment_.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cckm/390cckmj\";
 
url=\"http:__www.dillington.co.uk_day_course_details.asp?ED=Arts+and+Crafts&offset=66.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ccnl/256ccnl\";
 
url=\"http:__www.esporta.co.uk_Clubs_Mids+%26+East+Anglia_Oxford_Promotions_Member%27s+Forum_!+!_CLASS_Advert_DBID_17ea4c66d7bd2c0aeb4513c89cb01afd.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cdfc/391cdfcq\";
 url=\"http:__www.fst.rdg.ac.uk_news-archive-2004-11.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cdng/392cdngg\";
 url=\"http:__www.aslib.co.uk_training_careers_9.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cdqv/393cdqvl\";
 
url=\"http:__www.ncl.ac.uk_undergraduate_course_A106_profile_Can-I-spend-time-on-an-elective.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cedi/394cedie\";
 url=\"http:__www.incomesdata.co.uk_europe_duediligence.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cegc/395cegcr\";
 url=\"http:__www.pennine.demon.co.uk_NPC_1982_MEXICOSP.HTM.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/celp/396celpi\";
 url=\"http:__www.hsl.gov.uk_publications_car.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cest/397cesta\";
 url=\"http:__www.baronage.co.uk_bphtm-01_const-02.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cewp/398cewpo\";
 
url=\"http:__www.assureweb.co.uk_public_Main.asp?Params=65C5B21F70C4D12078C6116FD0FD01ED50A0B7BBFBEBBDC7F85DB3C8C41964AFCF3977972B54AAC68E8AE50A7AB1888C6DEE8379864B7E79F21CA9025DF7DD55D14C83960FBA06562FFBA3B67013B5558FE96AD7.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cfec/399cfecj\";
 url=\"http:__www.myleedsjobs.co.uk_jobdetails-11834.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cfuh/400cfuhe\";
 url=\"http:__www.paperairplanes.co.uk_orplan.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cfvq/401cfvqe\";
 url=\"http:__www.eca.ac.uk_tacitus_news.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cfwj/402cfwje\";
 url=\"http:__www.casino-avenue.co.uk_2004_06_duuuuh.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cfwt/403cfwtc\";
 url=\"http:__www.dwp.gov.uk_lifeevent_penret_penreform_5_reg.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cfxp/404cfxpi\";
 url=\"http:__www.framearch.co.uk_projects_T5_excavation.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cgbz/405cgbzt\";
 url=\"http:__www.redcross.org.uk_section.asp?id=49633.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cgel/406cgelv\";
 url=\"http:__www.forestforum.org.uk_jobs_forestsmonitor2001.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cgjx/407cgjxf\";
 url=\"http:__www.twickenham-museum.org.uk_kids_detail.asp?ContentID=189.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cgzy/257cgzy\";
 url=\"http:__www.offthetelly.co.uk_interviews_markwright.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/chns/408chnsk\";
 url=\"http:__www.sitcom.co.uk_tlc_characters.shtml.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/chny/409chnyq\";
 url=\"http:__www.aberdeen-grampian.co.uk_whiskycountry_ess_walk.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cicu/410cicug\";
 url=\"http:__www.shipleygreenparty.org.uk_sgpnewsarticle20051222a.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cipr/411ciprs\";
 url=\"http:__www.employment-solicitors.co.uk_Employer1.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/citx/412citxq\";
 url=\"http:__www.princessquare.co.uk_news_Food_Sounds_So_Good_at_DArcys_.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cjia/413cjiau\";
 url=\"http:__www.sweetsforu.co.uk_shipping.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cjjc/414cjjcl\";
 url=\"http:__www.e.volve.org.uk_Listings.aspx?index=387&item=2929.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cjqc/415cjqcv\";
 url=\"http:__www.verko.co.uk_product.aspx?catno=53&prod=HCAA6241.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cjta/416cjtan\";
 
url=\"http:__www.romancesouthwest.co.uk_main_en_att-provider-ROMA_6913.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cjto/417cjton\";
 url=\"http:__www.art-works.org.uk_artworks_z030703b.shtml.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cjtu/418cjtub\";
 url=\"http:__www.jr2.ox.ac.uk_bandolier_booth_miscellaneous_wristgang.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cjwg/419cjwgx\";
 url=\"http:__www.changingdiabetes.co.uk_view.asp?ID=92.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ckcu/420ckcux\";
 url=\"http:__www.lashelmets.co.uk_las%20new%20bionix%20page.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ckiv/421ckivb\";
 
url=\"http:__www.cb-com.co.uk_listgen.asp?layout=results-brief.asp&page=37&sql=&sortup=sorttitle&bookstatus=OK.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ckju/422ckjul\";
 url=\"http:__www.wildlifetrust.org.uk_cheshire_proj_harvest_survey.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ckrq/423ckrqy\";
 url=\"http:__www.enemydown.co.uk_clancomments.php?id=35113.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ckym/424ckymf\";
 url=\"http:__www.scis.org.uk_search_menu_new.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/clez/425clezz\";
 
url=\"http:__www.uk-wholesaler.co.uk_softbook_clickbankmembership_clickbankprotector.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/clsl/426clslk\";
 url=\"http:__www.blewa.co.uk_project5_teachers_T5-0-1.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cmna/427cmnal\";
 
url=\"http:__www.politicalwizard.co.uk_administration_childsocnew_index.php?category=campaigns&c=i&uid=2130.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cnos/428cnosx\";
 url=\"http:__www.kent-ccc.co.uk_news_story.php?id=660.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cnoz/429cnozg\";
 
url=\"http:__beehive.thisisexeter.co.uk_default.asp?WCI=SiteHome&ID=9908&PageID=56638.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cnqw/430cnqwk\";
 
url=\"http:__bookshop.blackwell.co.uk_jsp_id_0340894342_Divine_Madness.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cntu/431cntup\";
 url=\"http:__www.newble.co.uk_chalmers_innes.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cody/432codym\";
 url=\"http:__www.searchenginespy.co.uk_article0027.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/coip/433coipg\";
 
url=\"http:__www.scottishcorpus.ac.uk_corpus_search_document.php?documentid=1211.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cplj/434cpljw\";
 
url=\"http:__www.peterhead.org.uk_familyheritage_forum_topic.asp?TOPIC_ID=26&.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cpnw/435cpnwa\";
 
url=\"http:__www.frenchhouserestoration.co.uk_franceproperty150to200_propertyandhousesforsalelimousinabn0509263.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cqha/436cqham\";
 
url=\"http:__www.rcpsych.ac.uk_college_faculties_liaison_documents_servicedevelopment_managerialfacilities.aspx.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cqng/437cqngo\";
 
url=\"http:__www.tlchm.bris.ac.uk_safety_various_rass_kmweb_safety_msds.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cqrd/438cqrdh\";
 url=\"http:__www.msabritain.co.uk_index.php?id=23&L=3&article=13.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/crkw/439crkwd\";
 
url=\"http:__www.frontier.net.uk_FAQsearch.asp?search_strFields=strMetaKeywords&search_strType=FAQS&search_strAreaNo=1053,2011&strKeyword=PS2006_4_3.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/crsr/440crsrx\";
 url=\"http:__www.itreviews.co.uk_games_g232.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/csah/441csahf\";
 url=\"http:__www.petergasston.co.uk_2002_09_to-quote-the-four-seasons.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/csct/442csctp\";
 
url=\"http:__www.kevinmayhew.co.uk_Mobile_default.aspx?group_id=16538&article_id=21979.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/csmw/443csmwd\";
 
url=\"http:__www.ceac.aston.ac.uk_research_staff_jpf_papers_paper26_index.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/csqc/444csqcw\";
 url=\"http:__www.all-energy.co.uk_newsletter45.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/csxj/445csxju\";
 url=\"http:__socialistworker.org.uk_article.php?article_id=8138.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ctcu/446ctcuk\";
 url=\"http:__www.setdanceteacher.co.uk_newmarketmez.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ctfd/447ctfdn\";
 url=\"http:__www.ebi.ac.uk_interpro_DisplayIproEntry?ac=IPR002824.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ctgo/448ctgol\";
 url=\"http:__www.garthyfog.co.uk_mawddach_valley.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cuua/258cuua\";
 url=\"http:__www.swcc.org.uk_caving_expeditions_jura05_jura_circ1.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cvbw/449cvbwe\";
 url=\"http:__www.oca-online.co.uk_viewnews.cfm?news_id=177.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cvgk/450cvgkz\";
 
url=\"http:__www.chichester.co.uk_mk4CustomPages_CustomPage.aspx?PageID=24163&sectionID=4585.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cvqe/451cvqer\";
 
url=\"http:__www.webstar.co.uk_~musnews_news_search.php?search=&start=12080.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cwbp/452cwbpn\";
 url=\"http:__www.linc4info.org.uk_cms_pages_sitemap.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cwcq/453cwcqx\";
 url=\"http:__www.ccp4.ac.uk_courses_IUCr2005_index.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cwde/454cwdel\";
 url=\"http:__www.perceptive-engineering.co.uk_html_training.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cwhb/455cwhbc\";
 url=\"http:__www.lanpac.co.uk_csi.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cwma/456cwmaf\";
 url=\"http:__www.chisenhale.org.uk_html_files_501_project_info.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cwzw/457cwzwc\";
 url=\"http:__www.ilnpictures.co.uk_showpage.asp?showdocumentid=196.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cxnk/458cxnko\";
 url=\"http:__www.starlink.rl.ac.uk_star_docs_sun232.htx_node17.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cxxy/459cxxyp\";
 
url=\"http:__www.publications.parliament.uk_pa_cm199900_cmhansrd_vo000405_debtext_00405-07.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/cyyc/460cyyci\";
 
url=\"http:__www.dwalker.pwp.blueyonder.co.uk_Fasti%20V.2_p.%20278%20PRESBYTERY%20OF%20PENPONT%20p.%20672.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/czqo/461czqoj\";
 url=\"http:__www.wessingtoncryogenics.co.uk_serv01.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/czve/462czvea\";
 
url=\"http:__www.aylesburyvale.gov.uk_avdc_content_index.jsp?contentid=1999276669.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dagx/463dagxc\";
 url=\"http:__www.stratford-upon-avon.co.uk_static_481.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/danu/464danug\";
 url=\"http:__www.synergygroup.co.uk_office-support-recruitment_.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/daqq/465daqqs\";
 
url=\"http:__www.sexshop365.co.uk_catalog_product_info.php?products_id=2981.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/daws/466dawss\";
 url=\"http:__www.honda-racing.co.uk_fourwheels_formula1_article.asp?a=1327.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dbbq/467dbbqw\";
 
url=\"http:__www.bioinformatics.leeds.ac.uk_~david_docs_api_javax_swing_JSplitPane.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dbhi/468dbhia\";
 url=\"http:__www.ateonline.co.uk_60_66_67_articles_7335.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dbjz/469dbjza\";
 url=\"http:__www.iae.co.uk_news_designedforthejob.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dbzi/470dbzic\";
 url=\"http:__www.oaa.org.uk_Case_Studies_studies_Ford3_Ford3.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dccp/471dccpz\";
 url=\"http:__www.jamesgourmetcoffee.co.uk_product.php?xProd=21&xSec=22.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dcon/472dcont\";
 url=\"http:__www.uservision.co.uk_usability_aboutus_usability_aboutus.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dcqe/473dcqen\";
 
url=\"http:__www.learnenglish.org.uk_crazyworld_series2_crazyworld_story.asp?latestchapter=12&subarea=11.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dcsp/474dcspt\";
 url=\"http:__www.guysherratt.co.uk_pages_searchdetails.asp?ID=776.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dddx/475dddxm\";
 url=\"http:__www.menshealthforum.org.uk_userpage1.cfm?item_id=1913.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ddqu/476ddquk\";
 url=\"http:__www.schoolhouse.org.uk_law_not_enrolled.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/deck/477decku\";
 
url=\"http:__www.womenintothenetwork.co.uk_page_calendar_archive_article.cfm?articleId=52.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/decp/478decpr\";
 url=\"http:__www.lathes.co.uk_beaver_page5.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dehf/479dehfj\";
 url=\"http:__www.hasslefreeminiatures.co.uk_rules.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/deqg/480deqgv\";
 
url=\"http:__www.newworknetwork.org.uk_modules_event_viewevent.php?eveid=109.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/derj/481derjo\";
 
url=\"http:__www.cheatgenius.co.uk_cheats_641_Gamecube-cheats_Gamecube-(hardware)-Cheats.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dfnm/482dfnmt\";
 url=\"http:__www.omega.co.uk_ppt_pptsc.asp?ref=LE902.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dfoo/483dfoor\";
 
url=\"http:__www.fulcrum-anglican.org.uk_forum_poster.cfm?sort=creatasc&poster=101.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dfvl/484dfvlp\";
 url=\"http:__www.la-hq.org.uk_directory_prof_issues_blreview.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dfzy/485dfzyc\";
 
url=\"http:__www.billyarmstrong.co.uk_050613_public_urged_to_back_london's_olympic_2012_bid.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dgmo/486dgmor\";
 
url=\"http:__chat.dailymail.co.uk_dailymail_threadnonInd.jsp?forum=106&thread=9757638&message=11724737.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dgti/487dgtix\";
 
url=\"http:__www.thebookpeople.co.uk_webapp_wcs_stores_servlet_product_10001_10051_20553_100_10012_10010_category_10010.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dgws/488dgwsx\";
 url=\"http:__www.bjhc.co.uk_news_industry_2005_ind505016.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dgzt/489dgztd\";
 url=\"http:__www.uea.ac.uk_eas_events_litfestspr04.shtml.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dheb/490dhebu\";
 url=\"http:__www.kimberry.co.uk_Dotnetlectures_Index.aspx.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dheg/491dhegq\";
 url=\"http:__www.lathes.co.uk_wolfjahnmiller_page2.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dhem/492dhemc\";
 url=\"http:__www.humanism.org.uk_site_cms_newsarticleview.asp?article=2173.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/dhow/493dhowh\";
 url=\"http:__www.anweb.co.uk_l_04_c3_c3a10.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/eizz/259eizz\";
 
url=\"http:__www.royalhigh.edin.sch.uk_content_subject_modernlanguages_course_s1s2.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/fodw/260fodw\";
 url=\"http:__www.law.warwick.ac.uk_ltj_4-1m.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/fvms/261fvms\";
 url=\"http:__www.citizenshipfoundation.org.uk_main_news.php?n20.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/guzs/262guzs\";
 url=\"http:__www.arnside-online.co.uk_care.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/hhvw/263hhvw\";
 url=\"http:__www.oarsport.co.uk_products_leatherman_micra.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/hkay/264hkay\";
 url=\"http:__www.bnp.org.uk_columnists_docdiary2.php?docId=103.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/hzbi/265hzbi\";
 url=\"http:__www.familiesonline.co.uk_article_articleview_1733_1_153.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/kbmr/266kbmr\";
 url=\"http:__www.ladybird-survey.pwp.blueyonder.co.uk_P_mugo.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/kjkq/267kjkq\";
 url=\"http:__www.free-internet.co.uk_email_sendmail.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/kzix/268kzix\";
 url=\"http:__www.cs.bham.ac.uk_resources_ums_PythonDoc_api_threads.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/lfwv/269lfwv\";
 
url=\"http:__www.thisismoney.co.uk_news_columnists_article.html?in_article_id=405873&in_page_id=50002.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ljrm/270ljrm\";
 url=\"http:__www.learningservices.gcal.ac.uk_synergy_03_scwbl.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/lptf/271lptf\";
 
url=\"http:__www.nisp.co.uk_pooled_articles_BF_NEWSART_view.asp?Q=BF_NEWSART_95582.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/lqyu/272lqyu\";
 url=\"http:__www.mountainsoftware.co.uk_printpage.asp?REF=_group.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/mail/273mail\";
 url=\"http:__www.dba.org.uk_aboutdba_chriswood.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/mpxl/274mpxl\";
 url=\"http:__www.surefish.co.uk_culture_books_0804_110804_food_books.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/mtps/275mtps\";
 url=\"http:__www.newtsnni.gov.uk_actionplan_04b.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/mtqb/276mtqb\";
 
url=\"http:__www.hummersknott.org.uk_Stud_Res_Info_Tec_Info_ICT_KS3_databases_relational_databases.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/ndi\"/247ndi\";
 
url=\"http:__www.stockportmbc.gov.uk_secondary_offerton_pages_SchemesofWork_KS4_skillswl.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/odv\"/248odv\";
 
url=\"http:__www.find-me-a-gift.co.uk_gifts-for-men_unusual-gadgets_mood-light-tile.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/omzz/277omzz\";
 
url=\"http:__www.swindon-speedway.co.uk_modules.php?op=modload&name=News&file=article&sid=175&mode=thread&order=0&thold=0.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/oojb/278oojb\";
 url=\"http:__www.engender.org.uk_justice.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/opab/279opab\";
 url=\"http:__www.heros.org.uk_home_sub.asp?page=2.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/orle/280orle\";
 url=\"http:__www.bsg.org.uk_clinical_prac_mar_05_mar05_08.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/osor/281osor\";
 url=\"http:__www.cdp.bham.ac.uk_About_CDP_methods.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/pabf/282pabf\";
 url=\"https:__www.cambs-police.co.uk_caminfo_blueprint_articles.asp?ID=807.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/pkbn/283pkbn\";
 url=\"http:__www.hmso.gov.uk_legislation_scotland_acts2002_20017--b.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/pxnz/284pxnz\";
 url=\"http:__www.dog-pictures.co.uk_dog-pictures_shiba_inu.shtml.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/qsym/285qsym\";
 url=\"http:__www.strawbale-building.co.uk_index.php?page=faq.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/qvpo/286qvpo\";
 url=\"http:__company.monster.co.uk_londonunderuk_tfl_our_careers.asp.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/rfk\"/249rfk\";
 url=\"http:__www.environment.bham.ac.uk_extindex.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/rfkm/287rfkm\";
 
url=\"http:__www.artshole.co.uk_exhibitions_Aug%2006%2004_James%20Cauty.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/rivm/288rivm\";
 url=\"http:__www.sincuser.f9.co.uk_050_lastwrd.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/rkwj/289rkwj\";
 url=\"http:__www.ocdaction.org.uk_skin-picking.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/rlwz/290rlwz\";
 url=\"http:__www.greenparty.org.uk_news_2033.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/rnce/291rnce\";
 url=\"http:__www.lel.ed.ac.uk_linguist_issues_17_17-229.html.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/sbdw/292sbdw\";
 
url=\"http:__sccplugins.sheffield.gov.uk_press_news_aRelease.asp?akey=2026&Mon=01_07_2004.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/sbqp/293sbqp\";
 url=\"http:__www.lpt.nhs.uk_service5.php.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/seaz/294seaz\";
 url=\"http:__www.panos.org.uk_resources_reportdetails.asp?id=1039.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/smar/295smar\";
 url=\"http:__www.arctech.co.uk_siemens_hosted_exchange.htm.txt"
----------------------------------------------------------------------

http://git-wip-us.apache.org/repos/asf/opennlp-sandbox/blob/1f97041b/"opennlp-similarity/src/test/resources/style_recognizer/txt/sqdo/296sqdo\";
 url=\"http:__gimbo.org.uk_archives_2006_01_chomsky_intervi.html.txt"
----------------------------------------------------------------------

Reply via email to