Привіт Гість ( Вхід | Реєстрація )

> Human Proteonome Folding, Phase 2, рассчет структуры белков в человеческом теле
Rilian
Jun 11 2008, 15:33
Пост #1


interstellar
**********

Група: Team member
Повідомлень: 17 162
З нами з: 22-February 06
З: Торонто
Користувач №: 184
Стать: НеСкажу
Free-DC_CPID
Парк машин:
ноут и кусок сервера



IPB Image

Human Proteome Folding Project
Phase 2


Официальные результаты проекта
Активные эксперименты
Human Microbiome Project - официальный сайт

http://homepages.nyu.edu/~rb133/wcg/thread_2010_03_10.html

Как присоединиться читайте в главном топике World Community Grid thumbsup.gif

Proteins are essential to living beings. Just about everything in the human body involves or is made out of proteins.

What are proteins?
Proteins are large molecules that are made of long chains of smaller molecules called amino acids. While there are only 20 different kinds of amino acids that make up all proteins, sometimes hundreds of them make up a single protein.

Adding to the complexity, proteins typically do not stay as long chains. As soon as the chain of amino acids is built, the chain folds and tangles up into a more compact and particular shape that lets it conduct specific and necessary functions within the human body.

Proteins fold because the different amino acids like to stick to each other following certain rules. Imagine that amino acids are pop-beads of 20 different colors. The pop-beads are sticky, but sticky in such a way that only certain combinations of colors can stick together. This makes the amino acid chains fold in a particular way that creates proteins that are useful to the human body. Human cells have mechanisms to help the proteins fold properly and, equally important, mechanisms to get rid of improperly folded proteins.

How do proteins relate to human genes?
The collection of all of the human genes is known as "the human genome." Depending on how the genes are counted, there are over 30,000 genes in the human genome. Each gene, which is a section of a long chain known as DNA, dictates how to build the chain of amino acids for one of the 30,000 proteins. In recent years, scientists were able to map the sequence for each human gene. This means that we now know the sequence of amino acids in all of the human proteins. Thus, the human genome is directly related to the "human proteome," the collection of all human proteins.

The protein mystery
While researchers have learned a great deal about the human proteome, the functions of most of the proteins remain a mystery. The genes do not reveal exactly how the proteins will fold into their final shape, which is critical because that determines what a protein can do and what other proteins it can connect to or interact with.

Proteins are like puzzle pieces. For example, muscle proteins connect to each other to form a muscle fiber. They join together in a specific manner because of their shape, as well as other factors relating to the shape.

Everything that goes on in cells and in the body is very specifically controlled by the shape of the proteins that do or do not let proteins interlock with other proteins. For example, the proteins of a virus or bacteria may have particular shapes that enable it to break through the cell membrane, allowing it to infect the cell.

The Human Proteome Folding Project
Знания структуры белков позволит ученым понять как белки выполняют свои биологические функции, а также как болезни блокируют белки от выполнения необходимых функций для поддержания здоровых клеток

The Human Proteome Folding Project will combine the power of millions of computers in a grid to help scientists understand how human proteins fold. The work to be done in this monumental task is shared across this grid, so that results can be achieved far sooner than would be possible with conventional supercomputers. With a greater understanding of protein structure, scientists can learn how diseases work and ultimately find cures for them.

When your grid agent is running, it is folding an amino acid chain in various ways and evaluating how well each folding follows the specific rules of how specific amino acids stick together or not. As computers try millions of ways to fold the chains, they attempt to fold the protein in the same way that it actually folds in the human body. The best shapes identified for each protein are returned to the scientists for further study.

IPB Image

-----

Оказывается тут тоже юзается розетта huh1.gif
(Show/Hide)

IPB Image


График проекта
IPB Image

Це повідомлення відредагував Rilian: Feb 4 2011, 00:23
User is offlineProfile CardPM
Go to the top of the page
+Quote Post
 
Reply to this topicStart new topic
Відповідей
Rilian
Dec 28 2011, 23:18
Пост #2


interstellar
**********

Група: Team member
Повідомлень: 17 162
З нами з: 22-February 06
З: Торонто
Користувач №: 184
Стать: НеСкажу
Free-DC_CPID
Парк машин:
ноут и кусок сервера



The Human Proteome Folding project research scientists have posted an informative status update on their web site. They highlight their recently published paper in Genome Research and an upcoming paper about the evolution of proteins. Future work is also discussed, including some work which should help the scientific community working on malaria.

6e047365df22.gif You may review their update here.

World Community Grid Post - HPF2 Update, November 2011

Greetings to everyone,

It's been a stretch since the last update, but excitingly (!), we've been quite busy wrapping up ongoing projects with publications, and also getting our teeth into new projects and data. So, without further ado, I'd like to first mention our accepted and pending publications, and then go over the new data we're crunching and where it is leading us.

The lab has been very excited to recently have two gargantuan efforts come to fruition with the acceptance of one paper and the completion and submission of a second. The first, Kevin Drew (et al.)'s, is an enormous work covering nearly everything we do in terms of protein structure and function prediction, and was made conceivable in the first place and achievable in the second by support of World Community Grid computing cycles.

The paper will be available in the journal Genome Research this month (November 2011). The abstract is as follows, and the lab spent extra to ensure an open license so that the paper could be viewed in full - take a look!

The incompleteness of proteome structure and function annotation is a critical problem for biologists and, in particular, severely limits interpretation of high-throughput and next-generation experiments. We have developed a proteome annotation pipeline based on structure prediction, where function and structure annotations are generated using an integration of sequence comparison, fold recognition and grid-computing enabled de novo structure prediction. We predict protein domain boundaries and 3D structures for protein domains from 94 genomes (including Human, Arabidopsis, Rice, Mouse, Fly, Yeast, E. coli and Worm). De novo structure predictions were distributed on a grid of over 1.5 million CPUs worldwide (World Community Grid). We generate significant numbers of new confident fold annotations (9% of domains that are otherwise unannotated in these genomes). We demonstrate that predicted structures can be combined with annotations from the Gene Ontology database to predict new and more specific molecular functions.

The paper can be viewed here: http://genome.cshlp.org/content/early/2011...75.111.abstract

Also, take a quick look at this seminal image from the paper - predicting domain boundaries, and using the grid to do de Novo structure prediction for unknown domains:

IPB Image


The second piece of good news is that another paper involving protein structure folding has recently been submitted for publication.

Melissa Pentony et al. have presented work considering sites of positive selection (areas of faster-than-average evolution) in the proteomes of five major plant species in order to study plant protein evolution, and have extended this analysis in a novel way by mapping sites of positive selection in proteins onto 3D predicted protein structures. This is exciting as, seen in the image below, it allows scientists to visualize where sites of increased evolution occur structurally on a protein.

IPB Image
[Image: a DNA-binding protein interacting with DNA, with positively selected residues of protein highlighted by blue spheres. Notice, then, that the parts of the protein interacting with DNA are under selected evolution!]



This work is currently being revised, and will be available for preview shortly - Another example of the grid producing data (predicted protein structure!) that can be used in diverse biological studies to extend analyses and relate biological phenomena to the fundamental molecular machines of the human body (proteins!).

Now on to what's been grinding on your CPUs...

IPB Image


Processing for the Human Microbiome Project (described in the last update) was finished with batch 'ok', and from there we moved on to Plasmodium Yoelii Yoelii, which made up batches 'ol' through 'op'.

I mentioned the bacteria Plasmodium Yoelii Yoelii in the previous status update and very briefly in my last forum post. Pyy is a rodent malaria used a model organism for studying malaria in general, and specifically human malaria (the concept of using very similar model organisms is common in the field, and is extremely helpful for increasing data set size and inferring properties of an organism from known properties in a model). For this reason, having accurate structural knowledge of Pyy is important for the malaria research community.

Knowing this, we looked up our collaborator Jane Carlton, recently moved to the NYU Department of Biology, and asked for the most up-to-date data. We were pointed to a resource called PlasmoDB ( http://plasmodb.org/plasmo/ ), and from the data we found there put together five batches of novel protein domains to be sent for de Novo structure prediction.

After malaria...

After malaria, while we updated our post-processing analyses to make better use of grid results, we moved on to Archaea, which make up the third domain of life (the other two being bacteria and eukaryotes). Archaea are incredibly interesting and important organisms - they're now getting a lot of press due to their role in the function of the human colonic system, and interestingly, some species are known to thrive in incredibly harsh environments, such as salt lakes and hot springs.

For more information on Archaea, check this Berkeley resource or, of course, wikipedia - Archaea. The archaea Haloferax and Haloarcula comprise batches oq through ow.

Pausing the Archaeas

At the moment, we have a large list of archaea to analyze, but have switched priorities due to some extremely exciting new ideas regarding protein function prediction based on machine learning techniques (which sounds AI-cool, but is more statistics-cool) which we have developed in house, and on revised proteome data for Mouse and Human.

We have decided to re-run this new mouse and human data through our domain prediction pipeline and send results to the grid in order to get the best possible protein structure data. With improvements and updates to our pre- and post-processing methods and increased sampling on the grid (we're now folding 100,000 structures per domain, up from 30,000!), we will be able to approach the problem of protein structure prediction in a novel and potentially game-changing way with the best data available.

In terms of work batches, ox through ql (we skipped the letter p in batch naming) are made up of Mouse protein data, with ox through ql running on the grid now. After ql, new Human data will take over.

The first culmination of this mouse and human redo, along with our new protein function prediction ideas, will be our presence at a nation-wide protein structure/function jamboree hosted by the University of California, San Diego in early December, where we will present the work of the grid and its incorporation into our new methods to hopefully astounding effect!

Cross your fingers for us… koc.gif awesome.png


--------------------
(Show/Hide)


IPB Image

IPB Image

IPB Image
IPB Image

загальна статистика: BOINCstats * FreeDC команда: BOINC команда Ukraine

IPB Image

IPB Image
User is offlineProfile CardPM
Go to the top of the page
+Quote Post

Повідомлення у даній Темі
Rilian   Human Proteonome Folding, Phase 2   Jun 11 2008, 15:33
nikelong   Росетта следит за тобой! ЗЫ: ты когда себе та...   Jun 11 2008, 16:37
(_KoDAk_)   :yes:   Jun 11 2008, 23:33
Rilian   надо бы перевести шапку... Кратко: проект с помощ...   Jan 2 2009, 16:30
Rilian   Насчитал 100 процессорных дней в проекте Это сост...   Jan 4 2009, 19:27
Rilian   Richard Bonneau, head scientist of the Human Prote...   Jan 6 2009, 04:44
Rilian   Рассчитал 500 протеинов за 142 процессорных дня   Jan 11 2009, 04:16
Rilian   Объявляется мини-соревнование - кто быстрее подсчи...   Jan 13 2009, 21:16
Rilian   Объявляется мини-соревнование - кто быстрее подсч...   Mar 6 2009, 22:07
nikelong   http://www.yeastrc.org/pdr/pages/search/ad...dSear...   Feb 13 2009, 15:59
cosmo_vk   кстати не у кого не возникало проблем с расчетом з...   Mar 7 2009, 08:04
Rilian   Не, но у меня на висте проблемы именно с HPFP2...   Mar 7 2009, 15:23
Rilian   Привет, Прошло уже некоторое время с тех пор ка...   Mar 25 2009, 02:10
cosmo_vk   А все-таки тормозятся вычисления, теперь уже такой...   Apr 1 2009, 18:20
Rilian   у меня на 2-гигагерцовых ксеонах бывает считает и ...   Apr 1 2009, 20:50
cosmo_vk   не-е у меня считает в районе 3-4 часов. Если больш...   Apr 2 2009, 06:49
Rilian   А... ну да, в HPFP2 оч редко такое бывает.. Может ...   Apr 2 2009, 11:31
Rilian   Patrick Winters продолжает радовать нас апдейтами ...   Apr 8 2009, 20:54
vitalidze1   cosmo_vk, В мене іноді такі лажі на компах вискан...   May 28 2009, 16:09
cosmo_vk   не-е на рисе у меня все нормально. Глюк с этим пр...   May 29 2009, 16:41
4 Сторінки V  1 2 3 > » 


Reply to this topicStart new topic
1 Користувачів переглядають дану тему (1 Гостей і 0 Прихованих Користувачів)
0 Користувачів:

 



- Lo-Fi Версія Поточний час: 2nd August 2025 - 05:32