On Thu, Mar 20, 2025 at 4:17 PM Trip Tucker <triptuckertrek(a)gmail.com>
wrote:
TOPIC: dsub & Docker to batch Google Cloud
Platform (GCP) Presenter:
Robert Citek (
https://www.sluug.org/bio/Robert_Citek
<https://www.google.com/url?q=https://www.sluug.org/bio/Robert_Citek&sa=D&source=calendar&usd=2&usg=AOvVaw1LFKPI0lZ7mn0ZFQulj6HD>
)
*From Months to Hours: *
* Batch Processing with Docker & dsub on Google Cloud Platform*
This presentation will showcase how dsub, a powerful command-line
tool leveraging Docker and Google Cloud Platform (GCP), enables highly
parallel and cost-effective execution of batch processing jobs in the
cloud. We will explore a real-world case study involving the
conversion of 10,000 images from a proprietary format to TIFF. This
project, which would have been prohibitively time-consuming and expensive
on a single system or even a small in-house cluster, was completed in hours
for a small fraction of the cost using dsub on GCP. The talk will
provide an overview of the project's challenges and how dsub's capabilities
in job submission, resource management, and parallelization were utilized
to achieve significant time and cost savings. Following the project
overview, a live demonstration of a small-scale dsub workflow on GCP will
illustrate the tool's ease of use and potential for accelerating various
computational tasks. Here are the links:
https://github.com/DataBiosphere/dsub
<https://www.google.com/url?q=https://github.com/DataBiosphere/dsub&sa=D&source=calendar&usd=2&usg=AOvVaw0dYM0CTtzG4fIlziZ8iEOC>
https://hub.docker.com/r/rwcitek/dsub/tags
<https://www.google.com/url?q=https://hub.docker.com/r/rwcitek/dsub/tags&sa=D&source=calendar&usd=2&usg=AOvVaw3j2HJ7Fa2v7Ipxe0RLf8sy>
========================================= From GENERIC DESCRIPTIONS:
dsub currently supports the Cloud Life Sciences v2beta
(
https://cloud.google.com/life-sciences/docs/reference/rest
<https://www.google.com/url?q=https://cloud.google.com/life-sciences/docs/reference/rest&sa=D&source=calendar&usd=2&usg=AOvVaw3QhnLoo7-C5ZsXkrYtgfEm>)
API
from Google Cloud and is is developing support for the Batch (
https://cloud.google.com/batch/docs/reference/rest
<https://www.google.com/url?q=https://cloud.google.com/batch/docs/reference/rest&sa=D&source=calendar&usd=2&usg=AOvVaw0MnSNLNq8ZL7DClrb46mKm>)
API
from Google Cloud. dsub: simple batch jobs with Docker (
https://github.com/DataBiosphere/dsub#dsub-simple-batch-jobs-with-docker)
<https://www.google.com/url?q=https://github.com/DataBiosphere/dsub%23dsub-simple-batch-jobs-with-docker)License&sa=D&source=calendar&usd=2&usg=AOvVaw386m7s6H_-TUSs82VAOHBy>
License
<https://www.google.com/url?q=https://github.com/DataBiosphere/dsub%23dsub-simple-batch-jobs-with-docker)License&sa=D&source=calendar&usd=2&usg=AOvVaw386m7s6H_-TUSs82VAOHBy>
(
https://github.com/DataBiosphere/dsub/blob/main/LICENSE
<https://www.google.com/url?q=https://github.com/DataBiosphere/dsub/blob/main/LICENSE&sa=D&source=calendar&usd=2&usg=AOvVaw1iT94JaRJXr4TTHpO8jouX>)
Overview (
https://github.com/DataBiosphere/dsub#overview)dsub
<https://www.google.com/url?q=https://github.com/DataBiosphere/dsub%23overview)dsub&sa=D&source=calendar&usd=2&usg=AOvVaw0_BMP7aJ3xqDaTPqrUMoxl>
is
a command-line tool that makes it easy to submit and run batch scripts in
the cloud. The dsub user experience is modeled after traditional
high-performance computing job schedulers like Grid Engine and Slurm. You
write a script and then submit it to a job scheduler from a shell prompt on
your local machine. Today dsub supports Google Cloud as the backend batch
job runner, along with a local provider for development and testing. With
help from the community, we'd like to add other backends, such as a Grid
Engine, Slurm, Amazon Batch, and Azure Batch.
======================================
On Thu, Mar 20, 2025 at 9:12 AM SLUUG Announcement List <
announce(a)sluug.org> wrote:
St. Louis Linux Users Group
(STLLINUX)
www.stllinux.org
__20_March_2025__
6:30 ~ 9:00 PM ( Central Daylight Time, USA )
**We will open the remote session at about 6:00 PM**
**Join early to test your microphone, screen and video sharing**
**** Microphone Or Webcam Are Not Required To Join ****
TOPIC: dsub on Google Cloud Platform by Robert Citek
Join Zoom Meeting
https://us06web.zoom.us/j/83011092509?pwd=bxo2mTwESAK7oGsF2WFr8o6T0MbNUU.1
The Saint Louis Linux Users Group (STLLINUX) is a non-profit, volunteer
group that provides education, information and support for Linux users.
You are invited to attend this next ONLINE session, in lieu of any
regular physical face-to-face meeting. Instructions are below.
* ONLINE Sessions
* NO PHYSICAL MEETINGS until further notice.
* ONLINE session will use ZOOM remote video service 20 March 2025.
--- invite start ---
Omnitec Corporation is sponsoring this scheduled ZOOM meeting.
www.omnitec.net
Topic: Saint Louis Linux Users Group (STLLINUX)
Time: Mar 20, 2025 18:30 Central Time (US and Canada)
Join Zoom Meeting
https://us06web.zoom.us/j/83011092509?pwd=bxo2mTwESAK7oGsF2WFr8o6T0MbNUU.1
Meeting ID: 830 1109 2509
Passcode: 698598
---
One tap mobile
+13052241968,,83011092509# US
+13092053325,,83011092509# US
---
Dial by your location
• +1 305 224 1968 US
• +1 309 205 3325 US
• +1 312 626 6799 US (Chicago)
• +1 646 558 8656 US (New York)
• +1 646 931 3860 US
• +1 301 715 8592 US (Washington DC)
• +1 669 444 9171 US
• +1 689 278 1000 US
• +1 719 359 4580 US
• +1 720 707 2699 US (Denver)
• +1 253 205 0468 US
• +1 253 215 8782 US (Tacoma)
• +1 346 248 7799 US (Houston)
• +1 360 209 5623 US
• +1 386 347 5053 US
• +1 507 473 4847 US
• +1 564 217 2000 US
Meeting ID: 830 1109 2509
Find your local number:
https://us06web.zoom.us/u/kd9kIcXK5C
--- invite end ---
FOR YOUR INFORMATION:
-- End-Of-File --