From 8e401da61ae2fd69d5b84f6a0133ba1f320d49f3 Mon Sep 17 00:00:00 2001 From: kdd-ubc Date: Mon, 22 Jul 2024 16:47:35 -0700 Subject: [PATCH] update --- .../index/execute-results/html.json | 4 ++-- about.qmd | 8 +------- 2 files changed, 3 insertions(+), 9 deletions(-) diff --git a/_freeze/posts/ribosome-tunnel-extraction/index/execute-results/html.json b/_freeze/posts/ribosome-tunnel-extraction/index/execute-results/html.json index 31bead4..de978f7 100644 --- a/_freeze/posts/ribosome-tunnel-extraction/index/execute-results/html.json +++ b/_freeze/posts/ribosome-tunnel-extraction/index/execute-results/html.json @@ -1,8 +1,8 @@ { - "hash": "7d177d99249553d6a737e22d5a967814", + "hash": "11cc7b1e1d8d244a8688d3bc3aa7334e", "result": { "engine": "jupyter", - "markdown": "---\ntitle: \"3D tessellation of biomolecular cavity\"\nsubtitle: \"Protocol for analyzing the ribosome exit tunnel\"\nbibliography: \"references.bib\"\ncsl: nature.csl\nengine: \"jupyter\"\nauthor:\n - name: \"Artem Kushner\" \n email: \"rtkushner@gmail.com\"\n affiliations:\n - name: KDD Group\n url: \"https://rtviii.xyz/\"\n\n - name: \"Khanh Dao Duc\" \n email: \"kdd@math.ubc.ca\"\n affiliations:\n - name: Department of Mathematics, UBC\n url: \"https://www.math.ubc.ca/\"\n - name: Department of Computer Science, UBC\n url: \"https://www.cs.ubc.ca/\"\n\ndate: \"29 June 2024\"\ncategories: [biology, bioinformatics, surface-reconstruction, computer graphics] \n\ncallout-icon: false\n# format:\n# pdf:\n# include-in-header:\n# text: |\n# \\usepackage{amsmath}\n\nexecute:\n echo: false\n freeze: auto\n pip: [\"pyvista\", \"open3d\", \"scikit-learn\", \"mendeleev\", \"compas\", \"matplotlib\"]\n\n---\n\n\n\n\n\n\n\n\n\n\n## Summary and Background\n\nWe present a protocol to extract the surface of a biomolecular cavity for shape analysis and molecular simulations.\n\nWe apply and illustrate the protocol on the ribosome structure, which contains a subcompartment known as the ribosome exit tunnel. More details on the tunnel features and biological importance can be found in our previous works [@dao2018impact; @dao2019differences]. The protocol was also design to refine the output obtained from MOLE software [@sehnal2013mole] (see Figure XXX)\n\n\n\n\n\n::: {#fig-intro layout=\"[1,1]\" layout-valign=\"bottom\"}\n\n![Structure of the human ribosome (PDB ID: 8OJ0).](./data/8OJ0.gif){#fig-ribo}\n\n![The ribosome exit tunnel (figure from Dao Duc et al. [@dao2019differences])](./data/ptc_and_tunnel_illustration.png){#fig-ribo2}\n\nThe ribosome and the exit tunnel extracted using Mole software [@sehnal2013mole]\n\n:::\n\n\n\n\n## Visual Protocol#\n\n\n![Schematic representation of the tunnel geometry surface reconstruction ](./data/visual_protocol.png){fig-alt=\"\"}\n\n\n\n\n## 1. Bounding Box \n\nThis step captures the subset of atoms enclosing the cavity of interest (the tunnel) from the original structure.\n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## Parameters\n\n$R_{\\mathbf{expansion}}$: define the radius of expansion to be the value added to $R_{x,y,z}$ of the centerline at every $C_{x,y,z}$\n\n$B$: The smallest bounding box containing all of the coordinates formed by the expanded centerline \n\n$R_{\\mathbf{vdw}}$: Van der Waals radius of an atom in Ã…ngstroms. Default to the value of 2.\n\n$pc_{B}$: The pointcloud formed by applying $B$ to the original structure and widening each atom coordinate to include points withing a sphere with radius $R_{\\mathbf{vdw}}$\n\n:::\n\n1. For each $C_{x,y,z}$, capture atoms whose center belongs to the sphere formed by $R_{x,y,z} + R_{\\mathbf{expansion}}$, filter out duplicates. Call this coordinate set the ${\\textit{Centerline Expansion Atoms}}$.\n\n2. Calculate the coordinates of the bounding box $B$ around the $\\text{\\textit{Centerline Expansion Atoms}}$.\n\n3. Apply $B$ to the initial structure to extract all atoms that belong to $B$. Call this $\\mathbf{pc^{B}}$.\n\n4. Widen each coordinate (atom center) $C_{x,y,z}^{pc_{B}}$ inside $pc_{B}$ by $R_{vdw}$ to include a more realistic representation of atoms. This is done by creating a voxel subgrid whose bounding cube is between $(C_{x}^{pc_{B}}-R_{vdw},C_{y}^{pc_{B}}-R_{vdw},C_{z}^{pc_{B}}-R_{vdw})$ and $(C_{x}^{pc_{B}}+R_{vdw},C_{y}^{pc_{B}}+R_{vdw},C_{z}^{pc_{B}}+R_{vdw})$, a cube of indices centered at $C_{x,y,z}^{pc_{B}}$. The resultant coordinate set is $\\mathbf{pc_{B}}$.\n\n5. Anchor the coordinates of the $pc_{B}$ to the origin by subtracting the $\\mu(pc_{B})$ of the coordinate set from each $C_{x,y,z}^{pc_{B}}$ and then shifting each $C_{x,y,z}^{pc_{B}}$ upwards by $|\\min(x,y,z)|$. This is done to reduce the amount of empty voxel cells in the following steps, reduce compute. \n\n## 2. Voxelization\n\n\n1. Assume voxel size of $1$ in correspondence to the units of the dataset, Angstroms in our case. (Alternatively, atom-to-sphere expansion in step **1. Bounding Box** should be accordingly scaled). \n\n2. Create a boolean voxel grid with the dimensions of the ($pc_{B}$ + $1$), call this the $Grid_{index}$ (as opposed to $Grid_{coordinate}$)\n\n3. Set voxels at _index_ [$C_{x},C_{y},C_{z}$] for every $C$ in $pc_{B}$ in the $Grid_{index}$ to $1$. All other voxel are $0$.\n\n\n## 3. Inversion\n\nInvert the $Grid_{index}$ to create a representation of the _\"empty space\"_ inside the exit tunnel. \n\n## 4. DBSCAN\n\nThe aim in this step is to extract only the voxels belonging to the \"empty space\" inside the tunnel and no other. Given that we have a good idea of the Van der Waals radii of the atoms that constitute the walls and have control over the size of the voxel in the $Grid_{index}$, one method that we can apply is DBSCAN. \n\nDBSCAN is a density-based clustering non-parametric algorithm that is akin to UMAP/t-SNE. \n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## DBSCAN Parameters\n\n$eps$: The maximum distance between two samples for one to be considered as in the neighborhood of the other. This is the most important DBSCAN parameter to choose appropriately for your data set and distance function.\n\n$min\\_nbrs$ ($min\\_samples$): If $min\\_samples$ is set to a higher value, DBSCAN will find denser clusters, whereas if it is set to a lower value, the found clusters will be more sparse. The metric to use when calculating distance between instances in a feature array.\n\n$metric$: The metric to use when calculating distance between instances in a feature array. We use the Euclidian distance.\n:::\n\n::: {#2ed5fc86 .cell execution_count=1}\n\n::: {.cell-output .cell-output-display}\n![](index_files/figure-html/cell-2-output-1.png){}\n:::\n:::\n\n\n## 5. Interior Surface via Delaunay Triangulation\n\nThe aim of this step is to extract a point cloud containing only the voxels on the surface of convex hull enclosing the interior space of the tunnel.\n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## Delaunay 3D parameters\n\n\n$alpha$: Controls the smoothness of the constructed alpha-shape. For a non-zero $alpha$ value, only vertices, edges, faces, or tetrahedra contained within the circumsphere (of radius $alpha$) will be output. Otherwise, only tetrahedra will be output.\n\n$tol$: Tolerance to control discarding of closely spaced points. This tolerance is specified as a fraction of the diagonal length of the bounding box of the points.\n\n$offset$: Multiplier to control the size of the initial, bounding Delaunay triangulation.\n\n:::\n\n## 6. Normal Estimation & Orientation \n\nThe aim of this step is to prepare the convex hull point cloud for the surface reconstruction algorithm. For the final mesh to be smooth and free of artifacts, this step has to assign a normal vector point outwards at each point of the convex hull thus defining a clear boundary between \"inner\" and \"outer\" space vis-a-vis the surface.\n\nOne popular method for normal estimation is a KDTree search and for smoothing their orientations a collection of tangent planes is used.\n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## KD Tree Search and Tangent Plane Orientation\n\n$kdtree\\_radius$: \n\n$kdtree\\_max\\_nn$:\n\n$tangent\\_planes\\_n$:\n\n:::\n\n\n## 7. Surface Reconstruction\n\n::: {#2eb12c74 .cell execution_count=2}\n\n::: {.cell-output .cell-output-display}\n![](index_files/figure-html/cell-3-output-1.png){}\n:::\n:::\n\n\n## References\n\n", + "markdown": "---\ntitle: \"3D tessellation of biomolecular cavities\"\nsubtitle: \"Protocol for analyzing the ribosome exit tunnel\"\nbibliography: \"references.bib\"\ncsl: nature.csl\nengine: \"jupyter\"\nauthor:\n - name: \"Artem Kushner\" \n email: \"rtkushner@gmail.com\"\n affiliations:\n - name: KDD Group\n url: \"https://rtviii.xyz/\"\n\n - name: \"Khanh Dao Duc\" \n email: \"kdd@math.ubc.ca\"\n affiliations:\n - name: Department of Mathematics, UBC\n url: \"https://www.math.ubc.ca/\"\n - name: Department of Computer Science, UBC\n url: \"https://www.cs.ubc.ca/\"\n\ndate: \"29 June 2024\"\ncategories: [biology, bioinformatics, surface-reconstruction, computer graphics] \n\ncallout-icon: false\n# format:\n# pdf:\n# include-in-header:\n# text: |\n# \\usepackage{amsmath}\n\nexecute:\n echo: false\n freeze: auto\n pip: [\"pyvista\", \"open3d\", \"scikit-learn\", \"mendeleev\", \"compas\", \"matplotlib\"]\n\n---\n\n\n\n\n\n\n\n\n\n\n## Summary and Background\n\nWe present a protocol to extract the surface of a biomolecular cavity for shape analysis and molecular simulations.\n\nWe apply and illustrate the protocol on the ribosome structure, which contains a subcompartment known as the ribosome exit tunnel. More details on the tunnel features and biological importance can be found in our previous works [@dao2018impact; @dao2019differences]. The protocol was also design to refine the output obtained from MOLE software [@sehnal2013mole] (see Figure XXX)\n\n\n\n\n\n::: {#fig-intro layout=\"[1,1]\" layout-valign=\"bottom\"}\n\n![Structure of the human ribosome (PDB ID: 8OJ0).](./data/8OJ0.gif){#fig-ribo}\n\n![The ribosome exit tunnel (figure from Dao Duc et al. [@dao2019differences])](./data/ptc_and_tunnel_illustration.png){#fig-ribo2}\n\nThe ribosome and the exit tunnel extracted using Mole software [@sehnal2013mole]\n\n:::\n\n\n\n\n## Visual Protocol#\n\n\n![Schematic representation of the tunnel geometry surface reconstruction ](./data/visual_protocol.png){fig-alt=\"\"}\n\n\n\n\n## 1. Bounding Box \n\nThis step captures the subset of atoms enclosing the cavity of interest (the tunnel) from the original structure.\n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## Parameters\n\n$R_{\\mathbf{expansion}}$: define the radius of expansion to be the value added to $R_{x,y,z}$ of the centerline at every $C_{x,y,z}$\n\n$B$: The smallest bounding box containing all of the coordinates formed by the expanded centerline \n\n$R_{\\mathbf{vdw}}$: Van der Waals radius of an atom in Ã…ngstroms. Default to the value of 2.\n\n$pc_{B}$: The pointcloud formed by applying $B$ to the original structure and widening each atom coordinate to include points withing a sphere with radius $R_{\\mathbf{vdw}}$\n\n:::\n\n1. For each $C_{x,y,z}$, capture atoms whose center belongs to the sphere formed by $R_{x,y,z} + R_{\\mathbf{expansion}}$, filter out duplicates. Call this coordinate set the ${\\textit{Centerline Expansion Atoms}}$.\n\n2. Calculate the coordinates of the bounding box $B$ around the $\\text{\\textit{Centerline Expansion Atoms}}$.\n\n3. Apply $B$ to the initial structure to extract all atoms that belong to $B$. Call this $\\mathbf{pc^{B}}$.\n\n4. Widen each coordinate (atom center) $C_{x,y,z}^{pc_{B}}$ inside $pc_{B}$ by $R_{vdw}$ to include a more realistic representation of atoms. This is done by creating a voxel subgrid whose bounding cube is between $(C_{x}^{pc_{B}}-R_{vdw},C_{y}^{pc_{B}}-R_{vdw},C_{z}^{pc_{B}}-R_{vdw})$ and $(C_{x}^{pc_{B}}+R_{vdw},C_{y}^{pc_{B}}+R_{vdw},C_{z}^{pc_{B}}+R_{vdw})$, a cube of indices centered at $C_{x,y,z}^{pc_{B}}$. The resultant coordinate set is $\\mathbf{pc_{B}}$.\n\n5. Anchor the coordinates of the $pc_{B}$ to the origin by subtracting the $\\mu(pc_{B})$ of the coordinate set from each $C_{x,y,z}^{pc_{B}}$ and then shifting each $C_{x,y,z}^{pc_{B}}$ upwards by $|\\min(x,y,z)|$. This is done to reduce the amount of empty voxel cells in the following steps, reduce compute. \n\n## 2. Voxelization\n\n\n1. Assume voxel size of $1$ in correspondence to the units of the dataset, Angstroms in our case. (Alternatively, atom-to-sphere expansion in step **1. Bounding Box** should be accordingly scaled). \n\n2. Create a boolean voxel grid with the dimensions of the ($pc_{B}$ + $1$), call this the $Grid_{index}$ (as opposed to $Grid_{coordinate}$)\n\n3. Set voxels at _index_ [$C_{x},C_{y},C_{z}$] for every $C$ in $pc_{B}$ in the $Grid_{index}$ to $1$. All other voxel are $0$.\n\n\n## 3. Inversion\n\nInvert the $Grid_{index}$ to create a representation of the _\"empty space\"_ inside the exit tunnel. \n\n## 4. DBSCAN\n\nThe aim in this step is to extract only the voxels belonging to the \"empty space\" inside the tunnel and no other. Given that we have a good idea of the Van der Waals radii of the atoms that constitute the walls and have control over the size of the voxel in the $Grid_{index}$, one method that we can apply is DBSCAN. \n\nDBSCAN is a density-based clustering non-parametric algorithm that is akin to UMAP/t-SNE. \n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## DBSCAN Parameters\n\n$eps$: The maximum distance between two samples for one to be considered as in the neighborhood of the other. This is the most important DBSCAN parameter to choose appropriately for your data set and distance function.\n\n$min\\_nbrs$ ($min\\_samples$): If $min\\_samples$ is set to a higher value, DBSCAN will find denser clusters, whereas if it is set to a lower value, the found clusters will be more sparse. The metric to use when calculating distance between instances in a feature array.\n\n$metric$: The metric to use when calculating distance between instances in a feature array. We use the Euclidian distance.\n:::\n\n::: {#1dc521e8 .cell execution_count=1}\n\n::: {.cell-output .cell-output-display}\n![](index_files/figure-html/cell-2-output-1.png){}\n:::\n:::\n\n\n## 5. Interior Surface via Delaunay Triangulation\n\nThe aim of this step is to extract a point cloud containing only the voxels on the surface of convex hull enclosing the interior space of the tunnel.\n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## Delaunay 3D parameters\n\n\n$alpha$: Controls the smoothness of the constructed alpha-shape. For a non-zero $alpha$ value, only vertices, edges, faces, or tetrahedra contained within the circumsphere (of radius $alpha$) will be output. Otherwise, only tetrahedra will be output.\n\n$tol$: Tolerance to control discarding of closely spaced points. This tolerance is specified as a fraction of the diagonal length of the bounding box of the points.\n\n$offset$: Multiplier to control the size of the initial, bounding Delaunay triangulation.\n\n:::\n\n## 6. Normal Estimation & Orientation \n\nThe aim of this step is to prepare the convex hull point cloud for the surface reconstruction algorithm. For the final mesh to be smooth and free of artifacts, this step has to assign a normal vector point outwards at each point of the convex hull thus defining a clear boundary between \"inner\" and \"outer\" space vis-a-vis the surface.\n\nOne popular method for normal estimation is a KDTree search and for smoothing their orientations a collection of tangent planes is used.\n\n:::{.callout-note appearance=\"simple\" collapse=\"true\"}\n\n## KD Tree Search and Tangent Plane Orientation\n\n$kdtree\\_radius$: \n\n$kdtree\\_max\\_nn$:\n\n$tangent\\_planes\\_n$:\n\n:::\n\n\n## 7. Surface Reconstruction\n\n::: {#d380d218 .cell execution_count=2}\n\n::: {.cell-output .cell-output-display}\n![](index_files/figure-html/cell-3-output-1.png){}\n:::\n:::\n\n\n## References\n\n", "supporting": [ "index_files/figure-html" ], diff --git a/about.qmd b/about.qmd index 692676f..abc0a9d 100644 --- a/about.qmd +++ b/about.qmd @@ -1,15 +1,9 @@ --- title: "About" -image: profile.jpg +#image: profile.jpg about: template: jolla links: - - icon: twitter - text: Twitter - href: https://twitter.com - - icon: linkedin - text: LinkedIn - href: https://linkedin.com - icon: github text: Github href: https://github.com