Page 3 of 4 FirstFirst 1234 LastLast
Results 21 to 30 of 34

Thread: Data Mining the London Gazette Website

  1. #21
    Join Date
    Nov 2007
    Location
    Bewdley, UK
    Posts
    2,700
    Thanks
    0
    Thanked 1 Time in 1 Post

    Default 1947 - new year honours - final ww2 related decorations

    37835 1 30 1947-01-01 supplement
    37836 31 34 1947-01-01 supplement
    37837 35 36 1947-01-01 supplement
    37838 37 92 1947-01-01 supplement
    37839 93 94 1947-01-01 supplement
    37840 95 102 1947-01-01 supplement
    37841 103 120 1947-01-01 supplement
    The Intellectual Property contained in this message has been assigned specifically to this web site.
    Copyright Ross McNeill 2015/2018 - All rights reserved.

  2. #22
    Join Date
    Nov 2007
    Location
    North Tynedale, Northumberland
    Posts
    420
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    That is quite a list. I wrote a data scraper to extract the details from the RAF Vault for WW1 airmen, and this took around 15 hours to process 26,593 entries. Most of that time was spent reading the URL and pulling out the information, very little post processing was involved, other than importing the final CSV file into a database. The text file alone was 6.2mb in size.

    I'm certainly interested in taking a look at it when I have the time.

    I have, from a very old data processing package I wrote for a former employer, a pattern matching algorithm too, which is quite good for quickly searching large text based documents and picking out useful data. I am sure that can be brought into play somewhere along the line.
    Last edited by JimCorbett1977; 31st March 2018 at 10:54.

  3. #23
    Join Date
    Jan 2008
    Location
    Delaware, USA
    Posts
    920
    Thanks
    173
    Thanked 16 Times in 11 Posts

    Default

    Ross, Jim, et al

    Before you start downloading the list that Ross has posted, LG has a better way of downloading them - if you hit a URL like https://www.thegazette.co.uk/London/...37245/data.pdf . you can download the entire pdf. Naturally downloading the entire PDF also downloads a lot of unncessary junk. But working with a smaller list of files is probably easier.

    Using the search, i came up with a list of 1197 issues that had "Royal Air Force" word in it or related references - and all of them are now downloaded and and also converted to text files (using existing adobe conversion that is already in the file). If folks want to chime in with the effort - ping me and i will add provide you the data.

    This is what i came up with I will reconcile this with Ross's list and see if any issues are missing. (ADDED Later : Realised now that Ross list is the entire LG run during WW2)



    34662 34945 35235 35525 35798 36047 36312 36512 36741 36985 37242 37508
    34674 34949 35238 35528 35801 36049 36314 36514 36744 36987 37243 37509
    34679 34951 35241 35531 35804 36051 36315 36516 36746 36989 37244 37511
    34683 34954 35244 35534 35806 36052 36319 36520 36748 36991 37246 37513
    34684 34958 35247 35535 35807 36054 36323 36521 36751 36994 37249 37515
    34687 34960 35251 35539 35809 36056 36326 36522 36755 36996 37250 37516
    34694 34964 35254 35541 35813 36059 36327 36524 36756 36997 37252 37518
    34699 34966 35257 35542 35817 36060 36329 36525 36757 36998 37255 37523
    34700 34970 35260 35544 35819 36063 36330 36530 36759 37000 37261 37526
    34705 34976 35263 35547 35820 36067 36333 36531 36761 37001 37264 37530
    34708 34977 35266 35551 35822 36070 36335 36532 36763 37003 37267 37532
    34711 34978 35270 35552 35824 36073 36336 36534 36766 37007 37269 37534
    34712 34982 35273 35557 35825 36075 36340 36536 36768 37009 37272 37535
    34713 34984 35276 35560 35827 36077 36341 36541 36770 37011 37273 37538
    34718 34986 35279 35561 35829 36079 36342 36542 36772 37017 37276 37541
    34721 34987 35283 35563 35831 36083 36344 36543 36776 37021 37278 37545
    34724 34989 35284 35565 35835 36084 36346 36544 36777 37022 37280 37546
    34727 34993 35287 35569 35837 36089 36348 36545 36778 37024 37282 37548
    34733 34996 35288 35570 35839 36091 36351 36547 36780 37025 37286 37553
    34738 34998 35291 35574 35841 36092 36352 36549 36781 37026 37289 37554
    34742 35001 35292 35575 35843 36094 36354 36550 36782 37029 37292 37556
    34744 35003 35295 35577 35845 36100 36356 36552 36783 37030 37295 37560
    34747 35005 35297 35580 35849 36104 36357 36554 36784 37034 37298 37561
    34752 35009 35301 35583 35855 36105 36361 36555 36787 37037 37304 37563
    34756 35010 35304 35585 35856 36108 36362 36556 36792 37042 37307 37568
    34757 35013 35305 35586 35858 36109 36363 36558 36793 37044 37309 37571
    34758 35014 35309 35592 35861 36113 36365 36560 36794 37046 37312 37576
    34760 35015 35312 35597 35864 36116 36368 36562 36795 37050 37315 37578
    34765 35019 35318 35598 35866 36119 36369 36565 36798 37053 37317 37581
    34767 35020 35319 35605 35868 36122 36373 36566 36800 37054 37323 37583
    34769 35022 35322 35606 35871 36127 36374 36570 36802 37055 37325 37584
    34771 35028 35326 35609 35873 36131 36376 36571 36804 37057 37327 37587
    34773 35029 35329 35614 35874 36135 36378 36577 36806 37063 37333 37588
    34776 35032 35330 35615 35876 36137 36380 36578 36809 37067 37334 37590
    34778 35037 35334 35618 35878 36138 36381 36581 36811 37071 37336 37593
    34780 35039 35335 35621 35881 36140 36385 36585 36814 37074 37342 37594
    34782 35042 35338 35624 35883 36141 36386 36588 36816 37078 37343 37597
    34784 35045 35341 35625 35885 36144 36387 36590 36819 37081 37345 37598
    34786 35048 35343 35628 35888 36145 36389 36592 36820 37083 37348 37602
    34788 35049 35346 35630 35891 36147 36391 36594 36821 37087 37351 37605
    34790 35054 35350 35632 35894 36148 36392 36597 36823 37093 37354 37609
    34792 35057 35353 35634 35900 36152 36393 36599 36825 37095 37356 37610
    34794 35059 35356 35638 35904 36157 36396 36604 36826 37097 37362 37614
    34795 35060 35358 35643 35911 36158 36397 36607 36827 37099 37365 37617
    34799 35065 35361 35644 35912 36161 36399 36609 36830 37100 37366 37619
    34801 35068 35364 35646 35914 36165 36401 36612 36834 37104 37367 37622
    34803 35071 35366 35647 35916 36166 36402 36614 36836 37105 37370 37623
    34805 35073 35367 35650 35919 36170 36405 36617 36838 37106 37371 37625
    34807 35076 35370 35652 35921 36175 36406 36618 36839 37108 37373 37628
    34810 35079 35373 35654 35923 36176 36407 36622 36842 37109 37375 37631
    34814 35083 35378 35658 35925 36178 36410 36626 36844 37114 37377 37634
    34817 35087 35383 35659 35927 36183 36412 36629 36848 37118 37379 37637
    34822 35090 35386 35661 35930 36184 36414 36630 36852 37119 37382 37638
    34824 35094 35387 35667 35933 36187 36417 36631 36855 37120 37389 37639
    34826 35097 35389 35669 35936 36189 36418 36633 36857 37122 37393 37641
    34828 35101 35391 35671 35938 36192 36420 36636 36859 37124 37398 37642
    34829 35102 35392 35675 35940 36196 36422 36639 36862 37125 37400 37645
    34831 35106 35394 35678 35943 36201 36423 36643 36863 37131 37402 37647
    34833 35107 35398 35679 35946 36202 36425 36645 36865 37133 37403 37651
    34838 35110 35402 35682 35947 36207 36428 36648 36866 37135 37405 37655
    34840 35114 35403 35686 35949 36208 36429 36651 36867 37137 37407 37656
    34842 35116 35404 35687 35954 36211 36430 36653 36869 37141 37408 37659
    34844 35119 35408 35691 35956 36215 36432 36656 36871 37144 37410 37663
    34847 35124 35413 35692 35958 36216 36436 36657 36878 37146 37412 37666
    34849 35127 35414 35694 35961 36219 36438 36660 36879 37150 37414 37668
    34852 35131 35416 35696 35963 36223 36439 36662 36882 37153 37418 37670
    34854 35134 35419 35699 35964 36226 36440 36664 36888 37155 37420 37672
    34856 35139 35424 35704 35966 36230 36442 36665 36891 37157 37423 37676
    34859 35141 35425 35707 35973 36231 36443 36667 36893 37160 37427 37679
    34860 35142 35426 35709 35976 36235 36447 36668 36896 37164 37429 37681
    34862 35145 35430 35712 35979 36241 36448 36670 36898 37165 37431 37685
    34864 35148 35434 35713 35981 36243 36449 36671 36900 37166 37432 37687
    34866 35151 35435 35717 35983 36245 36451 36672 36902 37168 37435 37691
    34870 35152 35439 35719 35986 36246 36453 36674 36906 37171 37438 37695
    34873 35158 35440 35721 35989 36248 36455 36681 36909 37174 37443 37697
    34876 35161 35444 35725 35992 36250 36458 36682 36913 37177 37445 37700
    34878 35165 35446 35727 35996 36254 36459 36683 36916 37179 37447 37703
    34881 35168 35451 35728 35998 36258 36462 36685 36919 37182 37449 37705
    34884 35171 35454 35729 35999 36260 36463 36686 36921 37183 37452 37708
    34887 35173 35460 35731 36002 36261 36467 36688 36923 37186 37453 37710
    34889 35176 35463 35736 36004 36263 36468 36692 36930 37190 37456 37712
    34892 35183 35467 35741 36009 36264 36472 36694 36931 37193 37457 37713
    34893 35184 35475 35747 36015 36267 36475 36701 36933 37196 37460 37716
    34896 35187 35476 35750 36016 36271 36479 36702 36935 37198 37463 37719
    34898 35190 35479 35752 36017 36272 36480 36703 36941 37200 37465 37721
    34900 35193 35483 35755 36020 36276 36481 36705 36943 37206 37468 37724
    34903 35195 35486 35757 36022 36280 36483 36706 36945 37208 37472 37727
    34904 35196 35491 35759 36024 36285 36484 36709 36953 37210 37479 37729
    34906 35199 35494 35765 36027 36286 36488 36712 36957 37212 37480 37731
    34910 35203 35497 35768 36029 36287 36489 36713 36960 37215 37482 37734
    34912 35204 35498 35770 36030 36289 36490 36716 36963 37218 37486 37735
    34915 35206 35502 35773 36032 36291 36492 36719 36967 37221 37489 37741
    34920 35208 35503 35775 36033 36294 36494 36722 36969 37226 37493 37744
    34923 35213 35507 35777 36034 36296 36499 36724 36971 37227 37495 37745
    34927 35217 35508 35780 36035 36299 36500 36726 36974 37228 37497
    34929 35219 35510 35784 36036 36302 36501 36728 36975 37231 37498
    34932 35222 35514 35787 36039 36307 36503 36729 36976 37233 37501
    34935 35225 35515 35791 36041 36308 36504 36732 36978 37234 37502
    34937 35228 35518 35793 36042 36309 36510 36734 36980 37237 37504
    34940 35232 35524 35795 36044 36310 36511 36736 36982 37240 37506

    I would request members to do a Quality check of this list - and if they have an issue number that has RAF officers data that is NOT in this list, then add it to this thread
    Last edited by Jagan; 31st March 2018 at 14:00.

  4. #24
    Join Date
    Jan 2008
    Location
    Delaware, USA
    Posts
    920
    Thanks
    173
    Thanked 16 Times in 11 Posts

    Default

    Quote Originally Posted by Franek Grabowski View Post
    I understand that Jagan is not after another LG search tool but rather to import the data in single process and then convert into a database. Hopefully, service numbers should be less prone to errors than names, and following conversion any errors and gaps should be plainly visible. Making it interactive, it shall be possible to correct errors and fill gaps by users. As Peter notes, I would not expect any new OCR of the files, so any improvements of the database or search tool will not solve the problem.
    Of course, such a database could be then ammended with data from other databases, like BMD, etc.


    I have ended up with about 1197 text files.. the text files need a little fixing manually to remove errors and maek them better, Automation got me to getting the PDFs and converting them to text.. I can also extract names and numbers in 70% of the cases (Where the number is on the same line as the name). but others - where the name wraps into the second line will be missed.

    If enough volunteers are there - we can start a project to quality check the 1197 text files - and then run the automation scripts .. we may end up with a pretty good DB.

    Leverage sharing tools like dropbox, google sheets etc..

  5. #25
    Join Date
    Nov 2007
    Location
    Reading, Berkshire, UK
    Posts
    3,605
    Thanks
    3
    Thanked 12 Times in 12 Posts

    Default

    If you want relatively computer-savvy (there are those who will disagree!) to be Pilot-Program cannon-fodder then I have the time. (This vital work will take precedence over gardening!)
    Take the same slice of data, try to disembrangle it various ways - with (a) the same set of volunteers, and/or (b) a different set of volunteers. Best way to knock the rough corners off a system!
    I was impressed how quickly the WW1 Cas Files were transcribed/checked on the RAFM site. BUT(!) there has to be ONE person i/c (i.e. no cttees!), and only ONE set of rules!
    We just need a few more 'volunteers'! If we don't get enough then I still have some unpublished information/stories/scuttlebutt that might 'persuade' some laggards to 'volunteer'!
    We can do it!
    HTH
    Peter Davies
    Meteorology is a science; good meteorology is an art!
    We might not know - but we might know who does!

  6. #26
    Join Date
    Jan 2008
    Location
    Delaware, USA
    Posts
    920
    Thanks
    173
    Thanked 16 Times in 11 Posts

    Default

    To give you an idea of how the data conversion happens.

    this is the source PDF file

    https://www.thegazette.co.uk/London/...34998/data.pdf

    When using a straight forward copy to text conversion - the output file is this

    https://docs.google.com/document/d/1...it?usp=sharing

    The above text file needs everything that is not in yellow highlighted to be deleted - removed (and there are 1197 text files like this) - This is where the volunteers come in first - review the text files to remove the excess information

    The RAF stuff that is relevant is only in page 16-17-18 - highlighted in yellow.

    Now that forms the raw feed for an automation script - and it extracts the following -



    Squadron Leader Hugh Lockhart MAXWELL^25046|, No
    18 Squadron
    Awarded a Bar to the DistinguishedFlying Cross


    Acting Wing Commander John NicholasHaworth WHITWORTH, D
    F
    C
    |26249|,No
    10 Squadron


    Acting Flight Lieutenant John WilliamSTEPHENS, D
    F
    C
    |40028|, No
    107Squadron
    Awarded the Distinguished Flying Cross
    "Wing Commander Sydney Osborne BUFTON|24184|, No
    10 Squadron
    Squadron Leader Stanley Sinclair BERTRAM|34078|, No
    99 Squadron
    Squadron Leader James Ellis McCoMB |90352|,Auxiliary Air Force, No
    611 Squadron
    Squadron Leader Wynne Somers GoodrichMAYDWELL |25075|, No
    53 Squadron


    Acting Squadron Leader John Stuart BARTLETT[|3437| No
    58 Squadron
    Acting squadron Leader John Noel MAHLER|37452|, No
    15 Squadron
    Acting Squadron Leader Douglas PARKER|37009|, No
    44 Squadron
    Acting Squadron Leader Richard KempWILDEY |37437|, No
    78 Squadron
    [Flight Lieutenant John Leighton BECK |37785|,No
    214 Squadron
    Flight Lieutenant Roderick HugK McCoNNELL|33268|
    Flight Lieutenant Louis Douglas WILSON|33295|-Acting Flight Lieutenant Richard SydneyEdward ALDRIDGE |70836|, No
    61Squadron
    Acting Flight Lieutenant Charles Brian BEST|41366|, No
    40 Squadron
    Acting Flight Lieutenant John Colpoys BREESE|33303|
    NO- 82 Squadron
    Acting Flight Lieutenant Clair Mansell MayburyGRECE |36168|, No
    59 Squadron
    Acting Flight Lieutenant Thomas MichaelHORGAN |40114|, No
    18 Squadron
    Acting Flight Lieutenant Cecil Dunn MILNE|33273|, No
    212 Squadron
    Acting Flight Lieutenant David StewartROBERTSON |40141|, No
    78 Squadron
    Acting Flight Lieutenant Derek John AubreyROE |70818|, Reserve of Air Force Officers,No
    107 Squadron
    Acting Flight Lieutenant Patrick John ShawSHAUGHNESSY |37758|, No
    50 Squadron
    Acting Flight Lieutenant Edward JoughinSHEERAN |39045|, No
    115 Squadron
    Acting Flight Lieutenant John Aloysius SIEBERT|36i55| No
    44 Squadron
    Acting Flight Lieutenant Theophilus John ScottWARREN |41227|, No
    107 Squadron
    Flying Officer Ernest Reginald BAKER |40660|,No
    210 Squadron
    Flying Officer Terence Malcolm BULLOCH|39373|, No
    206 Squadron/Flying Officer Dennis HAYLEY-BELL |39632|,No
    269 Squadron
    Flying Officer Swain MOXHAM |40413|, No
    50Squadron
    Flying Officer William Maxwell NIXON |39677|,No
    10 Squadron
    Flying Officer William Denys ButterworthRUTH |33397|, No
    50 Squadron
    Flying Officer George Edward WALKER |39803|,No
    58 Squadron
    Flying Officer John Douglas George WITHERS|40491|, No
    83 Squadron
    Acting Flying Officer Edward William BITMEAD|76001|, Royal Air Force Volunteer Reserve,N
    o
    115 Squadron
    Acting Flying Officer Charles Thorley KIMBER|42940|, No
    no Squadron
    Pilot Officer Charles Raymond BROWN |83727|,Royal Air Force Volunteer Reserve, No
    101Squadron
    Pilot Officer Ian Morgan Rutherford BROWNLIE|41899|, No
    77 Squadron
    Pilot Officer James Campbell CAIRNS |42692|,No
    10 Squadron
    Pilot Officer Hubert Percival CLARK |43106|,No
    44 Squadron
    Pilot Officer Edward William DEACON |44547|,No
    51 Squadron
    Pilot Officer John Richard DENNY |43954|,No
    78 Squadron
    Pilot Officer George Arthur Litchfield ELLIOTT|43830|, No
    77 Squadron
    Pilot Officer Cecil Victor FRASER |42212|,No
    115 Squadron
    Pilot Officer Robert Stewart GILMOUR |77102|,Royal Air Force Volunteer Reserve, N
    o
    15Squadron
    Pilot Officer Reginald Charles HARBOURNE|44562|, No
    44 Squadron
    Pilot Officer Donald Joseph HARKNESS |41694|,No
    75 |New Zealand| Squadron
    Pilot Officer Walter Ralph HASKELL |44563|,No
    40 Squadron
    Pilot Officer Terance Gerard HYNES |41707|,No
    44 Squadron
    Pilot Officer John LEWIS |40721|, No
    49Squadron
    Pilot Officer James Alexander LITTLE |42009|,N
    o
    83 Squadron
    Pilot Officer Joseph Laurence MOORE |42071|,No
    115 Squadron
    Pilot Officer Edward Donald PARKER, E
    G
    M
    |76465|, Royal Air Force Volunteer Reserve,No
    49 Squadron
    Pilot Officer Victor Fernley Baker PIKE |41868|,N
    o
    58 Squadron
    Pilot Officer Graham Voice RIDPATH |41955|,No
    44 Squadron
    Pilot Officer Donald Neil ROBINSON |40945|,No
    38 Squadron
    Pilot Officer William Michael Ronald SMITH|41216|, No
    44 Squadron
    Pilot Officer Keith James SOMERVILLE |41486|,No
    10 Squadron
    Pilot Officer Peter Wynn Mason WEST |40870|,No
    50 Squadron
    Awarded a Ear to the Distinguished FlyingMedal
    541685 Sergeant Ernest William GIMSON,D
    F
    M
    , No
    107 Squadron
    6700 THE LONDON GAZETTE, 22 NOVEMBER, 1940Awarded the Distinguished Flying Medal
    562801 Flight Sergeant Dennis Henry AllanMOORE, No
    58 Squadron
    562617 Flight Sergeant Christopher ArthurSAUNDERS, No
    61 Squadron
    581236 Sergeant Peter ALDERSON, No
    15Squadron
    525381 Sergeant Alfred George BAKE, No
    50Squadron
    642079 Sergeant William Charles BAKER,No
    15 Squadron
    624302 Sergeant George Robert BIRT, No
    44Squadron
    742085 Sergeant Cyril Denys BOOTHBY, RoyalAir Force Volunteer Reserve, No
    58Squadron
    515878 Sergeant Thomas Henry Joseph CAIRNS,No
    50 Squadron
    522989 Sergeant John DONNELLY, No
    50Squadron
    519433 Sergeant Robert John ELLIS, No
    75|New Zealand| Squadron
    537328 Sergeant Walter ELLIS, No
    49Squadron
    562685 Sergeant Charles Frederick FENNELL,No
    49 Squadron
    580841 Sergeant David GALLANDERS, No
    51Squadron
    552462 Sergeant Leonard Roy GOULD, No
    75|New Zealand| Squadron
    620436 Sergeant Harold GREGSON, No
    51Squadron
    514579 Sergeant Alfred Adair GRIFFIN, No
    51Squadron
    580447 Sergeant Douglas HAIGH, No
    107Squadron
    564681 Sergeant Maxwell Ronald HARVEY,No
    58 Squadron
    580959 Sergeant Douglas Alfred ErnestHAYHURST, No
    83 Squadron
    564688 Sergeant Wilfred Stanley HERRING, No
    44 Squadron
    519884 Sergeant Rowland Noel HOLLAND, No
    18 Squadron
    580451 Sergeant Kenneth Hugh LeslieHOUGHTON, No
    83 Squadron
    581154 Sergeant Kenneth Turner HOWARTH,No
    40 Squadron
    550880 Sergeant Robert Henry HUGHES,No
    75 |New Zealand| Squadron
    633777 Sergeant William Charles BrowneJESSE, No
    58 Squadron
    630868 Sergeant William JESSOP, No
    15Squadron
    565037 Sergeant John LAWSON, No
    233Squadron
    634500 Sergeant Godric LOUGHLIN, -No
    IOTSquadron
    551688 Sergeant Robert McCoRMACK, No
    83Squadron
    619335 Sergeant William McCoRMiCK, No
    50Squadron
    759153 Sergeant Gavin Allan McCREARY,Royal Air Force Volunteer Reserve, No
    40Squadron
    620020 Sergeant Harold MEE, No
    50Squadron
    628050 Sergeant Graham Mackinlay NEESON,No
    51 Squadron
    741545 Sergeant Kenneth NEWTON, Royal AirForce Volunteer Reserve, No
    40 Squadron
    580377 Sergeant George Duff NISBET, No
    83Squadron
    552573 Sergeant William Hugh PARKIN, No,115 Squadron
    5147805 Sergeant Hubert Everard PRESTON,No
    44 Squadron
    621139 Sergeant David ROBERTS, No
    53Squadron
    531759 Sergeant Edward John SAYER, No
    83Squadron'
    546247 Sergeant Cecil SMITH, No
    44 Squadron
    518369 Sergeant Eric Lester SMITH, No
    50Squadron
    631535 Sergeant Harold Walter STONE, No
    58Squadron
    580534 Sergeant Herbert Henry WILLIAMS,No
    82 Squadron
    563740 Sergeant John WOOD, No
    50 Squadron
    580838 Acting Sergeant Ronald CuthbertBESANT, No
    51 Squadron
    747769 Acting Sergeant Gordon COLLIS, RoyalAir Force Volunteer Reserve, No
    101Squadron
    580870 Acting Sergeant Jack Dunthorn LaurieHALL, No
    51 Squadron
    581527 Acting Sergeant Robert Frank PAVELY,No
    15 Squadron
    615871 Leading Aircraftman JamesMcCuTCHEON, No
    102 Squadron
    ROYAL NEW ZEALAND AIR FORCE
    Awarded the Distinguished Flying Cross
    Acting Flight Lieutenant Frederick JohnLUCAS, No
    75 |New Zealand| Squadron
    Flying Officer William Maurice ChalkWILLIAMS, No
    75 |New Zealand| Squadron
    The above awards are for gallantry anddevotion to duty in the execution of airoperations
    India Office,22nd November, 1940
    The KING has been pleased to appointCol
    |tempy
    Brig
    | H
    T
    D
    Hickman,O
    B
    E
    , M
    C
    , to be a District Comdr
    , 2ndNov
    1940
    The KING has approved the followingappointments, etc
    :—INDIAN ARMY


    The above is taken as input into an Excel file and a minute of work - it spits out the following:



    514579 Sergeant Alfred Adair GRIFFIN 5147805 Sergeant Hubert Everard PRESTON
    515878 Sergeant Thomas Henry Joseph CAIRNS
    518369 Sergeant Eric Lester SMITH
    519433 Sergeant Robert John ELLIS
    519884 Sergeant Rowland Noel HOLLAND
    522989 Sergeant John DONNELLY
    525381 Sergeant Alfred George BAKE
    531759 Sergeant Edward John SAYER
    537328 Sergeant Walter ELLIS
    541685 Sergeant Ernest William GIMSON
    546247 Sergeant Cecil SMITH
    550880 Sergeant Robert Henry HUGHES
    551688 Sergeant Robert McCoRMACK
    552462 Sergeant Leonard Roy GOULD
    552573 Sergeant William Hugh PARKIN
    562617 Flight Sergeant Christopher ArthurSAUNDERS
    562685 Sergeant Charles Frederick FENNELL
    562801 Flight Sergeant Dennis Henry AllanMOORE
    563740 Sergeant John WOOD
    564681 Sergeant Maxwell Ronald HARVEY
    564688 Sergeant Wilfred Stanley HERRING
    565037 Sergeant John LAWSON
    580377 Sergeant George Duff NISBET
    580447 Sergeant Douglas HAIGH
    580451 Sergeant Kenneth Hugh LeslieHOUGHTON
    580534 Sergeant Herbert Henry WILLIAMS
    580838 Acting Sergeant Ronald CuthbertBESANT
    580841 Sergeant David GALLANDERS
    580870 Acting Sergeant Jack Dunthorn LaurieHALL
    580959 Sergeant Douglas Alfred ErnestHAYHURST
    581154 Sergeant Kenneth Turner HOWARTH
    581236 Sergeant Peter ALDERSON
    581527 Acting Sergeant Robert Frank PAVELY
    615871 Leading Aircraftman JamesMcCuTCHEON
    619335 Sergeant William McCoRMiCK
    620020 Sergeant Harold MEE
    620436 Sergeant Harold GREGSON
    621139 Sergeant David ROBERTS
    624302 Sergeant George Robert BIRT
    628050 Sergeant Graham Mackinlay NEESON
    630868 Sergeant William JESSOP
    631535 Sergeant Harold Walter STONE
    633777 Sergeant William Charles BrowneJESSE
    634500 Sergeant Godric LOUGHLIN
    642079 Sergeant William Charles BAKER
    741545 Sergeant Kenneth NEWTON
    742085 Sergeant Cyril Denys BOOTHBY
    747769 Acting Sergeant Gordon COLLIS
    759153 Sergeant Gavin Allan McCREARY
    Acting Flight Lieutenant Cecil Dunn MILNE 33273
    Acting Flight Lieutenant Charles Brian BEST 41366
    Acting Flight Lieutenant Clair Mansell MayburyGRECE 36168
    Acting Flight Lieutenant David StewartROBERTSON 40141
    Acting Flight Lieutenant Derek John AubreyROE 70818
    Acting Flight Lieutenant Edward JoughinSHEERAN 39045
    Acting Flight Lieutenant Frederick JohnLUCAS, No
    Acting Flight Lieutenant John Aloysius SIEBERT 36i55
    Acting Flight Lieutenant John Colpoys BREESE 33303
    Acting Flight Lieutenant John WilliamSTEPHENS, D
    Acting Flight Lieutenant Patrick John ShawSHAUGHNESSY 37758
    Acting Flight Lieutenant Theophilus John ScottWARREN 41227
    Acting Flight Lieutenant Thomas MichaelHORGAN 40114
    Acting Flying Officer Charles Thorley KIMBER 42940
    Acting Flying Officer Edward William BITMEAD 76001
    Acting Squadron Leader Douglas PARKER 37009
    Acting Squadron Leader John Stuart BARTLETT[ 3437
    Acting Squadron Leader Richard KempWILDEY 37437
    Acting Wing Commander John NicholasHaworth WHITWORTH, D
    Acting squadron Leader John Noel MAHLER 37452
    Flight Lieutenant Louis Douglas WILSON 33295
    Acting Flight Lieutenant Richard SydneyEdward ALDRIDGE 70836
    Flight Lieutenant Roderick HugK McCoNNELL 33268
    Flying Officer Ernest Reginald BAKER 40660
    Flying Officer George Edward WALKER 39803
    Flying Officer John Douglas George WITHERS 40491
    Flying Officer Swain MOXHAM 40413
    Flying Officer Terence Malcolm BULLOCH 39373
    Flying Officer William Denys ButterworthRUTH 33397
    Flying Officer William Maurice ChalkWILLIAMS, No
    Flying Officer William Maxwell NIXON 39677
    Pilot Officer Cecil Victor FRASER 42212
    Pilot Officer Charles Raymond BROWN 83727
    Pilot Officer Donald Joseph HARKNESS 41694
    Pilot Officer Donald Neil ROBINSON 40945
    Pilot Officer Edward Donald PARKER, E
    Pilot Officer Edward William DEACON 44547
    Pilot Officer George Arthur Litchfield ELLIOTT 43830
    Pilot Officer Graham Voice RIDPATH 41955
    Pilot Officer Hubert Percival CLARK 43106
    Pilot Officer Ian Morgan Rutherford BROWNLIE 41899
    Pilot Officer James Alexander LITTLE 42009
    Pilot Officer James Campbell CAIRNS 42692
    Pilot Officer John LEWIS 40721
    Pilot Officer John Richard DENNY 43954
    Pilot Officer Joseph Laurence MOORE 42071
    Pilot Officer Keith James SOMERVILLE 41486
    Pilot Officer Peter Wynn Mason WEST 40870
    Pilot Officer Reginald Charles HARBOURNE 44562
    Pilot Officer Robert Stewart GILMOUR 77102
    Pilot Officer Terance Gerard HYNES 41707
    Pilot Officer Victor Fernley Baker PIKE 41868
    Pilot Officer Walter Ralph HASKELL 44563
    Pilot Officer William Michael Ronald SMITH 41216
    Squadron Leader Hugh Lockhart MAXWELL^25046 , No
    Squadron Leader James Ellis McCoMB 90352
    Squadron Leader Stanley Sinclair BERTRAM 34078
    Squadron Leader Wynne Somers GoodrichMAYDWELL 25075

    This is where the volunteers come in again - fix the minor issues like removing the "^".. or inserting spaces in names where they are missing.


    The final output can easily go into a database - along with the reference to the original LG Issue that the information came from.

  7. #27
    Join Date
    Nov 2007
    Posts
    944
    Thanks
    0
    Thanked 1 Time in 1 Post

    Default

    Looks fairly simple work, unless the one has to frequently verify the entries against the LG.
    Would it be possible to extract and add the reason of mention, ie. promotion or award? If it could be done, then the resulting database would be much superior to a simple database with names and serial number.
    https://www.facebook.com/Franciszek-Grabowski-241360809684411/

  8. #28
    Join Date
    Jan 2008
    Location
    Delaware, USA
    Posts
    920
    Thanks
    173
    Thanked 16 Times in 11 Posts

    Default

    Here is a first raw cut for the index.

    This has about 370,000 entries and 140,000 Unique service numbers (Officers and WOs) of RAF.

    Does not have data from Canadian, Auz, NZ or other commonwealth arms.

    As Ross mentioned - the data requires lot of effort to clean - So I did not attempt it beyond a couple of hours. - but atleast the service numbers are accurate


    http://www.rafcommands.com/database/lgindex/

    Give it a shot and let me know how your searches worked.

    when the index is developed a bit - it can link directly to the issue on the LG site..

  9. #29
    Join Date
    Nov 2007
    Posts
    944
    Thanks
    0
    Thanked 1 Time in 1 Post

    Default

    I think it is important to discuss all the possible issues and problems now, before the project is run, as any later changes might be a bit problematic.
    I believe a date of issue of LG would be very useful, and I think it shall not be a problem to add it automatically.
    Then we have entries like this.
    EQUIPMENT BRANCH Retirement Wg Cdr S D DENNIS
    Followed by say.
    J R GARDINER
    Of course the entry on branch and action might be removed, but I find it most useful, as it allows to immediately asses the information.
    So it would be.
    EQUIPMENT BRANCH,Retirement,Wg Cdr,S D,DENNIS
    ,,,J R, GARDINER
    If the order is kept, then it shall be possible to fairly easy fill blanks with branch and action.
    I mean that if the effort is to be put on extracting info from LG, then it would be worth to get maximum information at once.
    I would also consider a possible integration of the data with those extracted from other databases like RAF lists or AIR 78 (Do not curse me!). :)
    https://www.facebook.com/Franciszek-Grabowski-241360809684411/

  10. #30
    Join Date
    Nov 2007
    Location
    Reading, Berkshire, UK
    Posts
    3,605
    Thanks
    3
    Thanked 12 Times in 12 Posts

    Default

    Hello All,

    I agree most emphatically with Franek!! We must decide the Rules, and Format, of the End Product now – before we all start beavering away.

    I have, for more than a decade, been steadily manually converting the LG entries for the Meteorological Branch (+ lot of other info) into Excel. I currently have a s/sheet of some 7000 Rows and 11 Cols. It was/is very labour intensive – but I’m retired. I’m only moderately computer-savvy since I retired (20-odd yrs ago!).

    I did the same thing for the first 100 entries that were given. It is a very labour intensive exercise. The first s/sheet had 3 Cols. To make ALL that data fully searchable I needed 9 Cols. They were:
    A OR No – where the LG entry is a Commissioning, if it is from the OR then that OR No is a valuable research tool.
    B Off No – As the original A.
    C SURNAME – always in Capitals (there is, no doubt, an Excel trick for converting ‘text’ in either Col or Row to UPPER?)
    D Forename(s)/Init(s) – Got to be a mixture, but must gravitate towards Forenames over time.
    E Occ(urrence) – Prom(otion), Award, Rel(inquish), Ret(irement), Forf(eture) Cmd(Commissioned), Term(inated), Loc(ated), etc, etc. There needs to be a Primary List which everybody uses. Individuals can add Occurrences to their own database as their research dictates.
    F From – These are mainly ranks (shortened forms must be agreed and laid down). There are a number of other titles/positions – which should be agreed.
    G To/As – Same as F.
    H BRANCH – Very useful for tracking individuals, or Branches!, over time
    I AWARD – Post-nominals – a bit of a mine-field on its own!!
    J LG # - As the original C

    The original Col B will have disappeared – its contents now being distributed/displayed in the new Cols.

    I have (in my Met s/sheets) some DATE cols. It is vital that we all use the same (or interchangeable) format – either dd-mmm-yyyy, or yyyymmdd (or both?). Whatever is agreed has to be useable both by the experts – and by newcomers (who may only have minimal computer ability). Very often the ‘source’ only knows the month/year of an occurrence (AFLs, etc). In this instance I use a convention when entering the date – 15-mmm-yy (or yyyy) is entered in italic. So I can’t be more than 15 days in error!!!

    I have a Col for SERVICE. RNAS, RFC, RAF, RAFVR, RAFO, RAFR, AAF, etc, etc, and all the other organisations my blokes(and blokesses) came from and/or went to.

    And while we are talking about Service Numbers, can some bright spark work out how to cope with those numbers that have either Ident Numbers/Letters, and/or Check Letters, at front and/or back?


    I also make considerable use of the Excel ‘Comment’ facility. This should be retained.

    There, I’ve had my say. But if you want a vast army of Transcribers/Approvers then The Rules must be fire-proof! Like A=Yes, B=No, and C=refer upwards (but to where/who?).

    It will be difficult to get all interested in the same place/time. A big Post with, say, 6 examples of the END PRODUCT could be posted, and we all vote for our favourite?

    I hope this has been of help. I look forward some advances! I would like to help!

    Rgds

    Peter Davies
    Last edited by Resmoroh; 1st April 2018 at 14:47. Reason: QSD
    Meteorology is a science; good meteorology is an art!
    We might not know - but we might know who does!

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •