ROADMAP will simplify this process in the next version - using a reporting web page


  1. Login to the server
  2. Place yourself at the working directory - i.e. /opt/caraone_data1 for example
  3. Run this command to make reporting where all the tickets are much simpler:
    find . -maxdepth 1 -type d -exec sh -c 'echo -n "{} : "; find "{}" -type f | wc -l' \; | sort -n
  4. Consider this command running as a job that updates every "X" seconds/minutes.  Don't forget to be at the /opt/caraone_data1 level. It will output the numbers in each of the jobs_analyze_media and jobs_done directories for easier reading:

    while systemctl is-active --quiet runner; do     echo " "
        echo "                      <<<<<<<<<< Jobs_done ($(date '+%Y-%m-%d %H:%M:%S')) >>>>>>>>>>";     echo "video_: $(find jobs_done/ -maxdepth 1 -name 'video_*' | wc -l), picture_: $(find jobs_done/ -maxdepth 1 -name 'picture_*' | wc -l), pdf_: $(find jobs_done/ -maxdepth 1 -name 'pdf_*' | wc -l), txt_: $(find jobs_done/ -maxdepth 1 -name 'txt_*' | wc -l), avb_: $(find jobs_done/ -maxdepth 1 -name 'avb_*' | wc -l), ppt_: $(find jobs_done/ -maxdepth 1 -name 'ppt_*' | wc -l), doc_: $(find jobs_done/ -maxdepth 1 -name 'doc_*' | wc -l)" 
        echo " "
        echo "                      <<<<<<<<<< Jobs_analyze_media ($(date '+%Y-%m-%d %H:%M:%S')) >>>>>>>>>>";     echo "video_: $(find jobs_analyze_media/ -maxdepth 1 -name 'video_*' | wc -l), picture_: $(find jobs_analyze_media/ -maxdepth 1 -name 'picture_*' | wc -l), pdf_: $(find jobs_analyze_media/ -maxdepth 1 -name 'pdf_*' | wc -l), txt_: $(find jobs_analyze_media/ -maxdepth 1 -name 'txt_*' | wc -l), avb_: $(find jobs_analyze_media/ -maxdepth 1 -name 'avb_*' | wc -l), ppt_: $(find jobs_analyze_media/ -maxdepth 1 -name 'ppt_*' | wc -l), doc_: $(find jobs_analyze_media/ -maxdepth 1 -name 'doc_*' | wc -l)"
         sleep 600  # Wait for 10 minutes (600 seconds)
    done

  5. Place yourself inside the jobs_done or Jobs_analyze_media folders and run this command for a break down of the numbers of each type of files finished or in the queue.

    echo "video_: $(ls -1 video_* | wc -l), picture_: $(ls -1 picture_* | wc -l), pdf_: $(ls -1 pdf_* | wc -l), txt_: $(ls -1 txt_* | wc -l), avb_: $(ls -1 avb_* | wc -l), ppt_: $(ls -1 ppt_* | wc -l), doc_: $(ls -1 doc_* | wc -l)"


  6. You can also tail -f each log of interest.  Most helpful in the indexing process is the fswatch.log and the processingdriver.log
  7. Next check the processingdriver.log - look for tickets in progress and active processes, av queue, and doc queue 


2024-05-30 05:42:04 INFO tickets_in_progress: 1

2024-05-30 05:42:04 INFO Active processes: ['get_video_embeddings', 'send_audio']

2024-05-30 05:42:03 INFO    doc_queue: 7


5. Try checking the database files for anything that happened on a certain date - /opt/caraone_data/log$ cat */database.log | grep "2024-08-19"

6. At the file system level - run a "ls jobs_*" Look for analyze media for existing files still pending processing, and jobs_done for finished jobs


root@AInode-4:/opt/caraone_user1# ls jobs_*

jobs_analyze_media:

pdf_4durzi4ulq6fgiuwyeqt0lr5n  picture_27vo4tgcksafs  picture_32quvnftz9wuy          video_1n2yx7p3lc00   video_3qg3hucqajdvk

pdf_4egnt03okh81irrk6ozo8e47w  picture_288h7llexq5aj  picture_32wuh7fkemx6y          video_1qwr1vmr7zdhe  video_3qntwfuvh6rr4

pdf_56sv1acc4u3t2iwbmcm3ujyw7  picture_28shu1cdjqjjh  picture_34p80i0vhe48h          video_1rgxcuq4zumf4  video_3sn0carpa6rk0

pdf_7nzrvsu76n6bwl8533raiy3sc  picture_2c38n785m8ktj  picture_37gr7d62m5lxc          video_2612ovzusfrb4  video_3stg7pjwjflz4

pdf_833jkm6noa8j575wtwgvzad25  picture_2cmzet8vt630t  picture_3axbx0gytfkmg          video_261l81z8i7yww  video_3stu61x2glji8

pdf_9iva8le3qu2mko1q0wazgwbhp  picture_2f8u4uxh1uu4p  picture_3crqulzi3ujt5          video_26wskvteyucg   video_3vbsali0btf40

pdf_adto6o1s5mq5i3hidsfdzax30  picture_2mi25una1m32m  picture_3d9vh442kmvsd          video_3cwthd9djfbpc  video_3vgog4ykt2k8w

pdf_bqfnv9y1k3dby0x9l0hzfk29r  picture_2pac6dav9o87o  picture_3t4p5syn49lnm          video_3e48l8ajk9awx  video_3vm5h314scu0w

pdf_zqs7jz391o4ywz4yhaygf45b   picture_2qpgyjnk5msok  picture_3uxxryluvzsxj          video_3f1j032n81vjs  video_3w59b3mujruuw

picture_1z3zipy2vudad          picture_2t4augimglb1v  txt_21s3cjpmpa1hh7ny1hd1ge92   video_3f3yy7otw5ldu  video_3w5dx56mmhakg

picture_21f0ek7kegkf9          picture_2zksfbsokiptv  txt_5uuf8l6iu0szvya4gehdttyu7  video_3f3z6rwxc9wsf  video_9ptdpxflmagw

picture_21otqlf7a3ip3          picture_2zq472fwc1tqi  txt_6p6sfa7oxs1grb8cbd8m0at0x  video_3f41d1d4i5ou8  video_h4qe0d1gwadc

picture_228s51yrkrcud          picture_2zrkherg372g6  txt_a0xzuah0rz0keudthp2r5ihhu  video_3f4pz0niklfr4  video_l6tpibqbk

picture_23pteaiugrd4m          picture_303833z1pet55  txt_ez94na7acqqmkjcrg5bhh8rfs  video_3k213xraahq0w  video_wtz31nv7fh80

picture_24gp2pdlo31ad          picture_305wd4r5bcnv4  video_160pbzf8xmtc             video_3ne3nd4g2i83k  video_wuvll0o722kg

picture_27krxw7ytxnye          picture_308j42cbb7mp1  video_1dtg3h3lg5ons            video_3nfcoxshqxscr  video_yjjkz3o3rtog


jobs_delete:


jobs_done:

pdf_2pt6pk85wzwtyok6zmhxjrmee  pdf_a3xg10w0e4nblmxoo0hacsww5  video_13c7mcds41kk   video_3vo2frwi4mrc0  video_3w5b9jp6gaeim  video_4ng2ta65e8g0

pdf_48vip1rf9kl445tneu8rozcsy  pdf_af6vir5tsae5uohk9ei5qe2w5  video_1cdcqpvgmpygw  video_3vxgkn5ugcyhd  video_3w5b9kcd6o1ks  video_97ay3mxz2kg

pdf_53i0gbh8sz0iuby9r67lctq10  pdf_amvmk6beihz9xv8d34t7zz0cu  video_1xo6nxjjlbhsw  video_3w0dw2m26tlh8  video_3w5b9n7gje7e8  video_fb59c450alfk

pdf_59b8g66loshmystadaimyvx0j  pdf_av8amv91vn0gwgr9q411l4d0w  video_1xss6j2fovhmo  video_3w4rrosn4e2gw  video_3w5b9ydynf9xa  video_fmxu8ctzymzo

pdf_5sbihzauhib2893kar9f7y93m  pdf_awv78z1peug88utk0keba7q4m  video_2a6o31mobz8qq  video_3w5b998bo8kxr  video_3w5bbau648m4c  video_h7fl60yavm8

pdf_6zykfiorteytgucyx1e4iwu4o  pdf_cxwqb2drxj4pc106cp0wtu143  video_2x3yrdxx04z6o  video_3w5b99c9qgc8v  video_3w5bizxy03dc0  video_jovk53hwln9c

pdf_7b5gao0u1t1ng75bofbngsv68  pdf_mi3k3f16x9648vsca37b0ps3   video_3ejbp5e6wbpy7  video_3w5b99e8rkqv0  video_3w5byl4gg8ovw  video_q9xlyth59hir

pdf_7d2o2y30e3tlgq7u8nvwpnrqg  pdf_x251bkch0sa1ceu0031ex0ga   video_3k29hzb1ax6v4  video_3w5b99m4us072  video_3w5du63h4jj7k  video_s14gvbrqroqo

pdf_7sn64w8mmdkjum9dwcfskync0  pdf_y9e7ape01mpd8nfjiczr5blm   video_3ndz52y1p48am  video_3w5b99q2vr6da  video_3w5e0os2rs2nz  video_ve5id72s0hs

pdf_8769kqbheygfzxypmatpra1k1  video_112x5v9ku0236            video_3pzqu3d0c42ki  video_3w5b9comm987k  video_3w5e104hbs0e8

pdf_9nho42vr2sat7nmttyvynicwe  video_132p3vr1x1mao            video_3v0tfv50tnge8  video_3w5b9jp6bsovy  video_4cigvvwyafk0


jobs_error:


jobs_proxy:

root@AInode-4:/opt/caraone_user1#


Jobs proxy breakdown and jobs done and jobs analyse: 


clear; while systemctl is-active --quiet runner; do  echo "---------------------------";     echo; echo;
        echo "<<<<<<<<<< jobs_analyze_media ($(date '+%Y-%m-%d %H:%M:%S')) >>>>>>>>>>";         echo "video:    $(find jobs_analyze_media/ -maxdepth 1 -type f -name 'video_*'          | wc -l)";      echo "picture:  $(find jobs_analyze_media/ -maxdepth 1 -type f -name 'picture_*'        | wc -l)";      echo "pdf:      $(find jobs_analyze_media/ -maxdepth 1 -type f -name 'pdf_*'            | wc -l)";      echo "txt:      $(find jobs_analyze_media/ -maxdepth 1 -type f -name 'txt_*'            | wc -l)";      echo "avb:      $(find jobs_analyze_media/ -maxdepth 1 -type f -name 'avb_*'            | wc -l)";      echo "ppt:      $(find jobs_analyze_media/ -maxdepth 1 -type f -name 'ppt_*'            | wc -l)";      echo "doc:      $(find jobs_analyze_media/ -maxdepth 1 -type f -name 'doc_*'            | wc -l)";      echo;   echo;     
        echo "<<<<<<<<<< jobs_done ($(date '+%Y-%m-%d %H:%M:%S')) >>>>>>>>>>";  echo "video:    $(find jobs_done/ -maxdepth 1 -type f -name 'video_*'           | wc -l)";      echo "picture:  $(find jobs_done/ -maxdepth 1 -type f -name 'picture_*' | wc -l)";      echo "pdf:      $(find jobs_done/ -maxdepth 1 -type f -name 'pdf_*'     | wc -l)";      echo "txt:      $(find jobs_done/ -maxdepth 1 -type f -name 'txt_*'     | wc -l)";      echo "avb:      $(find jobs_done/ -maxdepth 1 -type f -name 'avb_*'     | wc -l)";      echo "ppt:      $(find jobs_done/ -maxdepth 1 -type f -name 'ppt_*'     | wc -l)";      echo "doc:      $(find jobs_done/ -maxdepth 1 -type f -name 'doc_*'     | wc -l)";      echo;   echo;   
echo "<<<<<<<<<< jobs_proxy ($(date '+%Y-%m-%d %H:%M:%S')) >>>>>>>>>>";         echo "Tickets: $(find jobs_proxy/ -maxdepth 1 -type f  | wc -l)";       echo;           sleep 600; done